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ABSTRACT 


This  final  report  presents  the  results  of  work  performed  under  RADC  contract  No. 
F30602-78-C-0274.yThe  effort  described  specifically  addressed  the  problem  of 
deriving  indicator  and  descriptor  data  from  the  narrative  text  portions  of  a  class  of 
intelligence  messages  dealing  with  events  related  to  missile  and  satellite  launchings 
used  for  input  to  the  Advanced  Indicator  System  (AIS)  data  base. 

The  introductory  section  briefly  discusses  the  intelligence  problem  which  OSI’s 
event  processing  technology  intends  to  solve,  summarizes  the  technology  developed 
under  this  contract,  and  presents  the  conclusions  drawn  on  the  basis  of  the  results 
obtained. 

Section  2  offers  a  summary  of  OSI’s  methodological  approach  to  the  analysis  and 
description  of  event  reports.  This  methodology,  initially  developed  on  the  basis  of 
messages  dealing  with  air  activities,  was,  under  this  contract,  extended  to  cover 
reports  of  events  Involving  missile  and  satellite  launchings  and  related  events./  Sub¬ 
section  2.2  presents  the  characteristics  of  the  maximal  unit  of  analysis:  the  EVENT 
REPORT,  while  subsection  2.3  discusses  the  characteristics  of  messages  from  the 
point  of  view  of  their  conceptual  organization  and  that  of  their  linguistic  organization, 
and  gives  details  of  the  analytical  procedures  adopted  for  their  analysis.  Subsection 
2.4  discusses  two  representational  constructs  of  fundamental  importance  in  event 
processing:  the  Template,  and  the  Event  Record.  Subsection  2.6  outlines  some 
issues  involved  In  the  problem  of  reference,  while  subsection  2.6  provides  guidelines 
for  the  establishment  of  a  research  corpus. 

-  ''Section  3  describes  the  Missile  and  Satellite  domains,  and  presents  the  results  of 
their  analysis  in  terms  of  a  domain  definition.  The  discussion  includes  a  characteriza¬ 
tion  of  the  event  report  in  terms  of  its  component  messages;  a  list  of  the  message 
types  encountered  in  the  domains  under  consideration;  a  list  of  the  event  types 
identified  together  with  their  descriptor  system,  and  a  definition  of  the  sublanguage 
in  terms  of  its  vocabulary  and  syntax. 


Section  4  focuses  on  the  implementation  of  OSI’s  message  text  analysis  system, 
MATRES  II.  It  briefly  reviews  the  principles  underlying  OSI's  event  processing  techo- 
nology  and  offers  an  overview  of  MATRES  II.  fjhP  computer  programs  which  embody 
OSI’s  approach  to  the  automated  analysis  of  message  text  are  written  in  FORTH,  Pro¬ 
log,  and  SNOBOL4,  and  run  on  a  PDP  1 1  /45  under  the  RSX  1 1 D  operating  system. 
Finally,  the  analytical  processes  utilized  by  R^ATRES  II  are  illustrated  by  means  of 
examples. 
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EVALUATION 


The  objective  of  this  effort  consists  in  augmenting  and  extending 
the  capabilities  of  the  existing  message  text  processing  methodology 
to  the  subject  domain  of  satellite  and  missile  sightings.  The  work 
described  deals  with  the  analysis  of  textual  reports  of  events  and 
the  synthesis  of  relevant  information  elements  in  a  format  suitable 
for  automated  input  to  the  AIS  database. 

The  effort  is  directed  at  providing  computerized  aids  to  the  I&W 
analyst  in  distilling  the  contents  of  incoming  text  messages  into 
compact,  formatted,  computer  processable  content  representations  in 
support  of  his  mission  to  predict  the  future  on  the  basis  of  information 
describing  past  and  present  events.  The  analyst's  difficulties  result 
from  the  fact  that  the  volume  of  message  traffic  is  normally  very 
high  and  increases  sharply  in  a  crisis  situation.  This  prevents 
efficient  handling  and  full  exploitation  of  the  enormous  amount  of 
variables  contained  in  the  message  traffic  under  both  normal  and 
critical  operating  conditions.  Since  a  computer  experiences  no 
difficulty  in  processing  large  numbers  of  variables,  the  notion  of 
automating  this  task  provides  a  logical  solution  in  the  context  of 
information  explosion. 

The  significance  of  the  subject  effort  consists  in  computer  modeling 
of  the  analyst's  cognitive  activities  in  reading  and  understanding 
message  text,  transforming  its  contents  into  information  items  of 
interest,  and  building  a  conceptual  model  of  the  information  conveyed 
in  the  message.  In  order  to  accomplish  this  task,  the  computer  must 
be  equipped  with  representations  of  both  linguistic  and  extra- 
linguistic  knowledge  inherent  in  cognitive  faculties  of  the  analyst. 

The  approach  to  computer  modeling  of  understanding  relies  heavily 
on  the  recent  and  current  theoretical  advances  in  computational 
linguistics,  language  theory,  artificial  intelligence  and  cognitive 
psychology. 

ZBIGNIEW  L.  PANKOWICZ 
Project  Engineer 
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1.0  INTRODUCTION  AND  SUMMARY 

1.1  Introduction 

This  final  report  presents  the  results  of  work  performed  under  RADC  contract  No. 
F30602-78-C-0274.  The  effort  described  specifically  addressed  the  problem  of  deriv¬ 
ing  indicator  and  descriptor  data  from  the  narrative  text  portions  of  a  class  of  intelli¬ 
gence  messages  dealing  with  events  related  to  missile  and  satellite  launchings  used  for 
Input  to  the  Advanced  Indicator  System  (AIS)  data  base.  The  following  sections  briefly 
discuss  the  intelligence  problem  which  OSI’s  event  processing  technology  intends  to 
solve  (1.1.1.),  and  summarize  the  development  of  the  technology  thus  far  (1.1.2.) 

1.1.1  Problem  Statement.  The  task  of  an  intelligence  analyst  is  to  predict  the  future  on 
the  basis  of  information  describing  what  has  happened  in  the  past  and  what  events  are 
currently  taking  place. 

At  the  global  level,  the  questions  the  analyst  asks  himself  are:  "What  is  happening?" 
"What  does  It  mean  In  terms  of  my  knowledge  about  similar  events  In  the  past?",  "What 
Is  going  to  happen  next"?  He  is  concerned  with  certain  states  of  affairs,  and  events 
signifying  changes  in  these  states  of  affairs. 

When  working  with  a  single  message,  the  analyst  seeks  answers  to  at  last  the  following 
questions: 

1 .  What  is  its  information  content? 

2.  How  reliable  Is  the  source? 

3.  How  "credible"  Is  the  data? 

His  evaluations  of  incoming  information  are  basad  on  his  cognitive  models  of  certain  kinds 
of  situations,  the  personalities,  entities,  and  processes  involved,  and  the  potentialities 
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and  constraints  associated  with  changes  in  an  existing  state  of  affairs. 

Given  the  volume  of  information  he  must  sift,  and  the  complexity  of  the  cognitive  models 
involved,  the  difficulties  of  the  analyst’s  task  are  obvious.  Aids  to  support  his  analytical 
processes  clearly  must  involve  means  for  distilling  the  content  of  incoming  information 
Into  a  form  which  is  compact,  usable,  and  compatible  with  his  view  of  the  world. 

Information  on  the  world  situation  comes  to  the  1&W  analyst  mainly  in  the  form  of  intelli¬ 
gence  messages,  which  are  electrically  received  in  an  l&W  center  24  hours  a  day.  The 
messages  come  from  many  different  originators,  and  are  largely  in  the  form  of  narrative 
text.  The  volume  of  message  traffic  is  extremely  high,  and  in  a  crisis  situation, 
Increases  dramatically.  Even  under  normal  operating  conditions  it  is  very  difficult  for  an 
analyst  to  isolate  items  of  information  from  message  text  and  to  assimilate  and  correlate 
these  items  into  a  pattern  of  events  of  indications  significance.  In  a  crisis  situation,  the 
analyst  is  completely  saturated  with  data,  and  the  performance  of  his  task  demands 
superhuman  capabilities  for  handling  the  enormous  number  of  variables  which  are  con¬ 
tained  In  the  message  traffic. 

A  computer,  on  the  other  hand,  can  process  targe  numbers  of  variables.  Thus,  the  notion 
of  offloading  some  of  the  variable  processing  functions  onto  the  machine  seems  to  pro¬ 
vide  a  logical  solution  to  the  information  problem. 

One  of  the  interesting  developments  in  this  direction  is  the  Advanced  Indications  System 
(AIS),  which  currently  has  the  capability  to  statistically  analyze  intelligence  data  and  to 
display  such  data  in  a  logical  and  useful  form. 

Briefly,  the  AIS  provides  the  following: 

a.  A  structure  for  continuous  objective  and  systematic  monitoring  of  selected  indicator 
time  series. 
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b.  Computer  based  logic  for  detecting  the  significant  patterns  in  current  data,  compar¬ 
ing  them  to  past  activities,  and  quantifying  the  departure  from  normally  observed 
activity. 

c.  The  capability  to  interact  at  a  single  point  with  a  comprehensive  data  base  to 
assimilate,  investigate,  present  and  resolve  the  unusual  situations  detected. 

At  the  current  stage  of  development,  the  messages  to  be  analyzed  by  the  AIS  are  manu¬ 
ally  sorted  into  functional  threat  focused  indications  categories.  Data  elements  called 
’indicators’  and  ’descriptors’,  which  are  n-ary  structures  of  various  types*,  are  then 
derived  from  relevant  segments  of  certain  messages,  and  subsequently  entered  into  the 
system’s  data  base  for  the  daily  update.  Due  to  the  experimental  and  developmental 
status  of  the  AIS,  update  of  the  descriptor/indicator  data  base  is  currently  performed 
manually  by  contractor  personnel.  When  the  system  achieves  operational  status,  update 
of  the  AIS  data  base  will  be  performed  by  an  l&W  analyst.  Despite  the  considerable 
benefits  he  will  derive  from  the  AIS,  maintaining  the  AIS  data  base  constitutes  still 
another  task  for  the  overburdened  analyst. 

1.1.2  Toward  a  Solution.  For  the  past  several  years  RADC  has  been  sponsoring  an 
exploratory  and  developmental  program  related  to  the  design  and  development  of  a  gen¬ 
eral  methodology  for  the  efficient  and  effective  exploitation  of  the  content  of  electri¬ 
cally  transmitted  intelligence  messages.  The  long  term  goal  of  this  work  is  to  develop  a 
system  which  would  assist  the  analyst  in  creating  and  maintaining  formatted  data  bases 
derived  from  natural  language  text,  and  thus  offload  some  of  the  processing  functions 
from  the  analyst  to  the  computer.  Such  a  system  should  provide  the  analyst  with  infor¬ 
mation  which  is  needed  for  the  attainment  of  his  particular  goal,  i.e.,  information  which  is 

*  These  are  described  in  detail  in  a  classified  appendix  to  the  OS!  Final  Technical 
Report  RADC-TR-77-1 94,  June  1977. 
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relevant  to  his  task,  is  of  high  epistemic  standing,  and  therefore  useful  to  solving  his 
problems. 

Figure  1-1  shows  the  components  of  a  total  system  for  message  exploitation  and 
highlights  the  focus  of  the  RADC  program. 

As  mentioned  above,  the  work  described  ih  this  report  is  concerned  with  the  analysis  of 
textual  reports  of  events  and  the  synthesis  of  relevant  information  elements  in  a  format 
suitable  for  automated  input  to  the  AIS  data  base. 

Specifically,  the  program  addresses  the  problem  of  automating  the  analysis  of  the  narra¬ 
tive  text  portions  of  intelligence  messages  describing  events,  with  the  aim  of  transform¬ 
ing  them  into  succinct,  formatted,  computer  processable  content  representations. 

The  automated  generation  of  information  elements  from  narrative  message  text  requires 
that  the  computer  in  some  sense  "understand"  natural  language  text.  Within  the  con¬ 
text  of  the  work  described  here,  we  say  that  a  computer  system  understands  an  input 
text  Insofar  as  it  can  construct  an  adequate  representation  of  the  information  content 
of  that  text.  Specifically,  we  require  that  the  output  of  the  computer  understanding 
process,  when  applied  to  some  message  text,  furnish  the  analyst  with  at  least  those 
Information  elements  that  he  would  himself  have  extracted  from  that  particular  text. 

OSI’s  approach  to  the  problem  of  computer  "understanding"  leans  heavily  on  theoretical 
advances  in  several  disciplines,  including  theoretical  linguistics,  computational  linguistics, 
artificial  intelligence,  text  linguistics,  and  cognitive  psychology.  A  survey  of  the  field  as 
related  to  the  work  reported  here  can  be  found  in  Silva  and  Montgomery  (1978)  and  Silva 
et  al.  (1979). 
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Figure  1-1.  Components  of  an  Experimental  Message  Exploitation  System 


The  aim  Is  to  model  the  cognitive  activities  of  the  human  analyst  as  he  reads  and  under¬ 
stands  message  text,  distilling  its  contents  into  information  items  of  interest  to  him,  and 
building  a  conceptual  model  of  the  information  conveyed  by  the  message. 

In  order  to  model  this  human  cognitive  activity,  the  computer  must  be  equipped  with 
representations  of  both  linguistic  and  extra-linguistic  knowledge,  and  a  means  of  mani¬ 
pulating  such  representations  for  the  analysis  of  text  and  synthesis  of  information  ele¬ 
ments.  The  elements  must  then  be  presented  in  a  clear  and  useful  format  suitable  for 
the  task  at  hand. 

1.2  Summary 

1.2.1  Scope  of  Current  Effort.  The  scope  of  the  effort  described  here  included  extend¬ 
ing  and  augmenting  the  capabilities  of  the  message  text  processing  methodology  origi¬ 
nally  developed  under  previous  contracts  to  the  missile  and  satellite  subject  domain. 

Briefly,  the  work  involved  the  establishment  of  a  research  corpus;  the  development  of  a 
transcription  scheme  for  the  sanitization  of  messages;  extensions  to  the  linguistic 
methodology;  extensions  and  additions  to  the  ATN  grammar  constructed  under  a  previous 
contract  to  accept  a  wider  range  of  linguistic  structures;  the  refinement  of  the  notion  of 
"template"  --  the  fundamental  information  structure  developed  for  the  organization  and 
representation  of  knowledge  about  events;  the  development  of  templates  for  the  missile 
and  satellite  domains;  and  finally,  additions  and  extensions  to  the  existing  algorithms  for 
the  interpretation  of  narrative  text  and  its  subsequent  transformation  into  formal  content 
representations. 

A  major  effort  was  devoted  to  the  development  of  additional  program  modules  to  accom¬ 
modate  new  syntactic  construction  types  in  the  missile  and  satellite  domains,  and  to  the 
provision. of  adequate  system  control. 
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1.2.2  Current  Capabilities  of  OSI's  Message  Text  Processing  System.  The  OSI  message 
text  analysis  system  has  the  capability  to  digest  narrative  text  and  systematically 
transform  it  into  concise,  machine'  processable  content  representations  called  'event 
records’,  In  which  a  message  can  be  viewed  from  several  perspectives:  time,  location, 
organization  involved,  activity  type,  etc. 

Specifically,  the  current  capabilities  of  the  system  are: 

a.  It  determines  the  key  event  described  in  a  message  on  the  basis  of  an  analysis  of 
its  first  sentence  and  presents  it  to  the  analyst  in  a  form  that  answers  the  basic 
"what  is  happening?"  question  (1.1.1.). 

b.  It  provides  information  useful  for  determining  the  reliability  of  the  source  by  recog¬ 
nizing  and  displaying  the  reported  source  of  an  event.  For  example,  if  an  event  is 
reported  by  a  foreign  news  agency,  the  name  of  that  news  agency  is  displayed  in 
the  event  record  under  the  heading  Infosource. 

c.  It  provides  information  helpful  to  evaluate  the  credibility  of  the  source  data,  by 
highlighting  the  probabilistic  information  associated  with  a  report  of  an  event.  Words 
such  as  'possible',  'probable',  'successfully'  constitute  judgments  of  the  originator  of 
a  message  as  to  the  reliability  of  the  data  reported.  Such  words  are  preserved  dur¬ 
ing  processing  and  remain  associated  with  the  term  they  modify  in  the  text.  For  an 
example  see  Table  1-1  below,  where  the  word  'successfi^Iy'  indicates  certitude  of 
the  successful  completion  of  the  deorbit  event  described. 

Table  1-1  shows  a  hypothetical  (partially  transcribed)  input  sentence  describing  a  major 
event  and  the  corresponding  event  record  produced  by  MATRES  II. 
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Table  1-1  Example  Input  and  Output  by  MATRES  II 


THE  NEWSAGENCY  PRESSNAME  ANNOUNCED  THAT  THE  TWO 
UNIDENTIFIED  SATELLITES  WHICH  WERE  LAUNCHED 
FROM  PLACENAME1  ON  09  MARCH  1973  AT  ZULUTIME  WERE 
SUCCESSFULLY  DEORBITED  INTO  THE  PLACENAME2  ON  09  APR  1978 
BY  THE  POLITNAME  ON  REVOLUTION  3NMBR 

Infosource=  THE  NEWSAGENCY  PRESSNAME 
Event:  DEORBIT 

Actions  SUCCESSFULLY  DEORBITED 
Agent=  BY  THE  POLITNAME 
Object:  SATELLITE 

...Equipments  UNIDENTIFIED  SATELLITES 
...Numbers  TWO 

...Relatives  LAUNCHED  FROM  PLACENAME1  ON  09  MARCH  1978  AT 
ZULUTIME 

Locations  INTO  THE  PLACENAME2 
Revolutions  ON  REVOLUTION  3NMBR 
Dates  ON  09  APR  1978 
+ - + 

The  report  is  divided  into  four  major  sections. 


Section  2  offers  a  summary  of  OSI’s  methodological  approach  to  the  analysis  and 
description  of  event  reports.  This  methodology,  initially  developed  on  the  basis  of  mes¬ 
sages  dealing  with  air  activities,  was,  under  this  contract,  extended  to  cover  reports  of 
events  Involving  missile  and  satellite  launchings  and  related  events.  Subsection  2.2 
presents  the  characteristics  of  the  maximal  unit  of  analysis:  the  EVENT  REPORT,  while 
subsection  2.3  discusses  the  characteristics  of  messages  from  the  point  of  view  of  their 
conceptual  organization  and  that  of  their  linguistic  organization,  and  gives  examples  of 
the  analytical  procedures  adopted  for  their  analysis.  Subsection  2.4  discusses  two 
representational  constructs  of  fundamental  importance  in  event  processing:  the  Tem¬ 
plate,  and  the  Event  Record.  Subsection  2.6  outlines  some  issues  involved  in  the  prob¬ 
lem  of  reference,  while  subsection  2.6  provides  guidelines  for  the  establishment  of  a 
research  corpus. 
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Section  3  describes  the  Missile  and  Satellite  domains,  and  presents  the  results  of  their 
analysis  in  terms  of  a  domain  definition.  The  discussion  includes  a  characterization  of 
the  event  report  In  terms  of  its  component  messages;  a  list  of  the  message  types 
encountered  in  the  domains  under  consideration;  a  list  of  the  event  types  identified 
together  with  their  descriptor  system,  and  a  definition  of  the  sublanguage  in  terms  of  its 
vocabulary  and  syntax. 

Section  4  focuses  on  the  implementation  of  OSI’s  message  text  analysis  system, 
MATRES  II.  It  briefly  reviews  the  principles  underlying  OSI’s  event  processing  technol¬ 
ogy  and  offers  an  overview  of  MATRES  II.  The  computer  programs  which  embody  OSI’s 
approach  to  the  automated  analysis  of  message  text  are  written  in  FORTH,  Prolog,  and 
SNOBOL4,  and  run  on  a  PDP  11/45  under  the  RSX  11D  operating  system.  Finally,  the 
analytical  processes  utilized  by  MATRES  II  are  illustrated  by  means  of  examples. 

Section  5  contains  a  list  of  references  to  books,  journal  articles,  conference  papers, 
doctoral  dissertations,  and  other  publications  of  relevance  to  the  work  described  here. 
Much  of  the  information  contained  in  these  publications  has  inspired  and  guided  the 
RADC-sponsored  developmental  work  on  automated  data  base  generation  since  its 
inception  several  years  ago. 

Appendices  A-E  contain  a  listing  of  sanitized  message  text  (A),  a  listing  of  the  combined 
lexicons  for  the  air  activities,  missile,  and  satellite  domains  (B),  a  listing  of  the  ERL  tem¬ 
plates  and  their  ancilliary  procedures  as  encoded  In  Prolog  (C),  a  listing  of  the  FSA  char¬ 
acter  processing  algorithm  (D),  and  a  set  of  examples  of  system  Input/output  (E). 

1.3  Conclusions 

OSI’s  message  text  analysis  methodology,  while  Initially  developed  on  the  basis  of  a  res¬ 
tricted  subject  domain  —  reports  of  air  activities  —  has  successfully  been  applied  to 
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two  new  subject  domains  --  those  of  missile  and  satellite  reports.  It  has  been  demon¬ 
strated  to  be  general  in  Its  applicability  to  different  subject  areas  and  therefore  exten¬ 
sible  in  a  non-trivial  manner  to  reports  of  events  involving  the  physical  movement  of 
objects  such  as  aircraft,  ships,  missiles,  and  satellites. 

It  Is  shown  that  the  automated  analysis  of  event  data  needs  to  take  into  account  all 
aspects  of  event  reporting  and  requires  a  truly  interdisciplinary  approach.  Several  lev¬ 
els  of  analysis  are  identified,  each  involving  a  different  aspect  of  event  reporting,  and 
each  based  upon  different  considerations. 

The  three  sublanguage  domains  studied  thus  far  consist  of  descriptions  of  events  involv¬ 
ing  aircraft  activities  and  launchings  of  missiles  and  satellites,  and  related  events.  In  all 
three  cases,  the  source  data  are  contained  in  the  text  portions  of  military  messages 
typical  of  these  subject  domains,  consisting  of  a  report  title  summarizing  a  given  event, 
followed  by  one  or  more  declarative  sentences  describing  that  event  (and  optionally, 
other  related  events). 

In  all  three  cases,  the  semantics  and  the  syntax  of  event  descriptions  are  constrained 
by  two  factors.  One,  by  the  particular  subject  domain,  and  two,  by  the  fact  that  the 
events  described  are  limited  to  what  is  observable  and  what  should  be  reported  accord¬ 
ing  to  a  reporting  procedure.  This  results  in  a  substantial  number  of  participial  construc¬ 
tions  of  various  types,  complex  nominalizatlons  and  agentless  passives,  as  well  as  a 
range  of  types  of  quantification,  conjunction,  complementation,  ellipsis,  and  anaphora. 

The  sublanguages,  although  less  extensive  In  their  inventory  of  syntactic  constructions 
than  event  reports  in  journalistic  narrative,  nevertheless  contains  certain  constructions 
which  present  challenging  semantic  problems.  Such  problems  include  the  treatment  of 
"respectively"  constructions,  as  well  as  certain  types  of  definite  anaphora  which  not 
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only  transcend  sentence  boundaries  and,  In  some  cases,  even  message  boundaries,  but 
often  are  of  the  kind  that  have  no  explicit  referent  In  the  previous  discourse. 

Of  the  three  languages  studied  thus  far,  the  discourse  structure  of  the  satellite  reports 
Is  considerably  more  complex  than  that  of  missile  reports  or  reports  in  the  air  activities 
domain.  While  in  air  activities  reports  the  description  of  a  given  event  is  often  com¬ 
pleted  within  a  single  sentence  (e.g.,  a  particular  aircraft  penetrated  enemy  airspace  at 
a  specific  location  and  a  specific  time),  in  missile  and  satellite  reports  the  complete 
specification  of  the  properties  of  an  event  and  of  the  object(s)  involved  more  frequently 
requires  several  sentences,  and  not  uncommonly,  several  messages.  Thus,  a  report  on 
some  launch  operation  can  consist  of  an  initial,  rather  skeletal  statement,  followed  by 
one  or  more  messages  received  over  a  period  of  time  which  update  the  previous  report, 
adding  to  and  sometimes  changing  previous  specifications. 

Although  event  reports  in  the  air  activities  domain  may  also  involve  several  messages, 
the  update  problem  is  much  simpler,  since  the  attributes  of  a  flight  event  are  fewer  and 
less  complex  than  those  of  satellite  events.  In  any  case,  the  boundaries  of  a  discourse 
relevant  to  a  single  event  can  range  from  a  single  sentence  to  several  messages.  The 
problem  of  assembling  the  total  mental  "picture"  relating  to  any  given  event  can  only  be 
approached  on  the  discourse  level. 

The  major  impact  of  adding  a  new  subject  domain  to  the  system’s  repertoire  was  felt  in 
two  related  areas:  vocabulary  analysis  and  template  construction.  Although  there  exists 
a  core  of  overlapping  vocabulary  for  the  three  subject  domains,  domain-specific  usage 
of  event-related  terms  necessitates  the  construction  of  separate  templates  for  each 
domain,  even  in  those  cases  where  they  share  the  central  concept. 

To  see  this,  consider  the  FLIGHT  concept,  which  is  shared  by  the  three  domains,  in  the 
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air  activities  domain,  it  happens  to  be  the  key  concept,  and  therefore  it  has  a  complex 
internal  structure.  In  the  missile  and  satellite  domains,  it  is  referred  to  only  tangentially 
to  express  the  duration  of  a  mission:  "THE  MISSILE  IMPACTED  AFTER  A  FIVE  MINUTE 
FLIGHT". 

The  syntax  of  the  three  sublanguages,  on  $he  other  hand,  exhibited  a  large  number  of 
similarities,  so  that  the  addition  of  the  new  domains  only  required  development  of  a  small 
number  of  new  constructions,  specifically,  relative  clause  constructions  and  appositive 
postmodofication,  which  was  relatively  rare  in  the  air  activities  reports  and  had  there¬ 
fore  not  been  sufficiently  developed. 

MATRES  II  is  still  at  an  early  stage  of  development.  No  attempt  has  been  made  to 
represent  a  complete  set  of  semantic  structures  for  the  interpretation  of  the  whole 
range  of  linguistic  expressions  occurring  in  the  messages  studied. 

On  the  contrary,  the  aim  was  to  find  a  small  set  of  structures  that  would  describe  as 
many  event  types  as  possible,  but  still  be  simple  and  compact  enough  to  be  the  basis  of 
a  manageable  and  understandable  computer  system. 

This  goal  has  to  a  great  extent  been  achieved.  By  limiting  the  scope  of  the  system’s 
knowledge,  it  has  been  possible  to  model  a  complex  domain  of  practical  significance,  and 
to  Implement  algorithms  that  "understand"  this  domain  in  a  limited  sense. 

From  the  theoretical  point  of  view,  it  Is  important  to  stress  that  the  development 
described  In  this  report  is  only  a  first  step  toward  a  formal  characterization  of  the  rela¬ 
tionship  between  knowledge-based  language  understanding  and  the  generation  of  Indi¬ 
cator  and  descriptor  data. 

However,  from  the  pragmatic  point  of  view,  the  work  carried  out  under  the  current  con¬ 
tract  has  demonstrated  that  OSI’s  initial  design  concept  was  sound,  and  can  be 
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developed  into  an  automated  support  system  for  l&W  functions.  The  timeframe  of  an 
operational  development  is  obviously  a  function  of  the  degree  of  automated  versus 
interactive  processes  -  -  the  more  interaction,  the  less  is  required  of  the  knowledge 
base. 

The  concepts  underlying  the  MATRES  II  design  and  implementation  appear  sufficiently 
useful  that  the  system  has  already  aroused  considerable  interest  both  within  and  out¬ 
side  the  intelligence  community.  With  some  additional  developmental  effort,  it  should  be 
possible  to  field  an  experimental  MATRES  system  in  the  near  future,  allowing  interested 
users  the  possibility  of  hands-on  evaluation  of  the  l&W  data  base  generation  concept. 
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2.0  METHODOLOGICAL  APPROACH 
2.1  Preliminary  Notions 

The  method  of  approach  which  OSI  has  adopted  since  the  inception  of  the  RADC  explora¬ 
tory  and  developmental  program  for  Automated  Data  Base  Generation  has  been  to  look 
ahead  to  the  potential  capabilities  of  a  future  system  for  both  interactive  and  fully 
automated  exploitation  of  the  narrative  text  of  intelligence  messages,  and  to  develop  a 
methodology  that  will  remain  valid  for  applications  of  considerably  greater  scope  than 
the  one  currently  under  development. 

This  section  offers  a  summary  of  OSI’s  methodological  approach  to  the  automated  gen¬ 
eration  of  indicator  and  descriptor  data  from  the  narrative  text  portions  of  intelligence 
messages  reporting  on  events  related  to  movements  of  physical  objects  such  as  air¬ 
craft,  ships,  missiles  and  satellites. 

This  methodology,  initially  developed  on  the  basis  of  messages  dealing  with  air  activities, 
was,  under  this  contract,  extended  to  cover  reports  of  events  involving  missile  and 
satellite  launchings  and  related  events. 

OSI’s  methodological  approach  is  centered  around  the  notion  of  "event",  which  is 
adopted  as  the  logical  unit  of  analysis,  and  thus  becomes  the  basis  for  describing  intelli¬ 
gence  information.  Although  the  concept  of  an  event  is  fundamental  to  many  research 
endeavors,  no  standardized  terminology  for  describing  or  classifying  events  seems  to 
exist.  In  many  cases,  the  definition  of  an  event  is  arbitrary  and  tailored  to  a  particular 
field  or  purpose.  Thus,  In  physics,  the  term  ’event’  usually  refers  to  a  point  in  the 
space/time  continuum,  while  in  mathematical  statistics  it  has  the  broadest  meaning,  that 
of  any  proposition,  whether  true  or  not.  The  philosopher  Russell  (1956),  regards  the 
event  concept  as  a  primitive  (i.e.,  as  undefined)  and  then  uses  it  to  define  a  series  of 
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time  points.  In  another  usage,  ’event*  refers  to  a  fact.  In  a  related  approach,  the  event 
concept  encompasses  the  parameters  of  ’actlon/time/location,’  and  is  used  to  refer  to 
a  type  of  activity.  For  a  detailed  treatment  of  the  event  concept  as  initially  developed 
by  OSI,  the  reader  is  referred  to  Kuhns  (1974),  or  Silva  et  al.,  (1 979). 

For  the  purpose  of  the  work  described  in  this  report,  we  have  expanded  the  definition  of 
an  event  as  previously  stated,  to  encompass  all  the  states,  processes,  and  activities 
associated  with  an  object  or  a  set  of  objects  from  the  inception  to  the  termination  of 
its/their  mission.  The  term  used  for  the  aggregate  of  these  states,  actions,  activities, 
and  processes  is  "global  event",  symbolized  as  EVENT. 

To  illustrate  this  usage  of  the  term  EVENT,  consider  the  story  of  Skylab,  as  reported  in 
the  news  media  (Los  Angeles  Times,  Newsweek,  and  Aviation  Week  between  1973  and 
1979). 

Briefly,  Skylab  was  the  orbital  workshop  which  was  launched  from  the  Kennedy  Space 
Center  on  the  14th  of  May,  1973.  It  incurred  serious  damage  at  lift-off  time  and  was 
later  repaired  by  the  astronauts  previously  scheduled  to  rendezvous  with  it.  The 
astronauts,  launched  in  a  modified  Apollo  service  module,  attached  themselves  to  Skylab 
by  "umbilical"  cords  (connections  to  life  support  systems),  and  salvaged  the  ailing 
spacecraft.  Subsequently,  Skylab  was  used  for  experiments,  and  was  later  abandoned. 
Recently,  it  caused  much  concern,  because  it  was  obvious  Skylab  was  going  to  deorbit 
and  crash  to  earth.  It  did  indeed  reenter  the  earth’s  atmosphere,  upon  which  it  began 
to  break  up  Into  pieces,  some  of  which  burned  up  on  reentry.  Others  impacted  in  the 
Indian  Ocean,  and  yet  others  landed  in  Western  Australia. 

From  a  global  perspective,  the  Skylab  story  constitutes  an  EVENT  and  all  the  various 
news  items  which  informed  us  of  the  state  of  Skylab  over  the  last  six  years  constitute 
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an  EVENT  REPORT. 


A  global  event  has  a  complex  internal  structure  composed  of  smaller  Interrelated  units: 
the  launch,  the  deorbit,  the  impact,  etc.  These  smaller  units  are  referred  to  as  ’events’, 
or  ’atomic  events’.  An  ’atomic  event’  is  roughly  either  a  property  that  an  object  has  at  a 
point  in  time  or  over  a  time  interval,  or  a  relation  that  holds  among  a  set  of  objects  or 
locations  at  a  point  in  time  or  over  a  time  interval. 

Descriptions  of  atomic  events  take  two  forms:  intensional  descriptions,  and  extensions/ 
descriptions. 

An  intensional  description  is  an  abstract  description  of  a  class  of  Individuals  in  terms  of 
a  set  of  invariant  properties  common  to  all  members  of  the  class.  Thus,  the  intensional 
description  of  the  class  of  launch  events  would  state  that  ail  such  events  are  associ¬ 
ated  with  an  object  that  can  be  launched.  In  the  satellite  domain,  the  object  is  a  satel¬ 
lite.  It  usually  has  some  specified  mission,  and  is  associated  with  an  orbit,  which  itself 
Is  described  in  terms  of  several  parameters  such  as  apogee,  inclination,  perigee,  and 
period.  Other  entities  associated  with  a  satellite  launch  are  the  launch  system  used  for 
Injecting  the  satellite  into  orbit,  the  launch  site,  and  the  time  and  date  of  the  launch. 

An  extensional  description  Involves  one  individual,  l.e.,  a  unique  member  of  a  class  of 
Individuals  in  the  world  being  modeled.  A  simple  example  is  the  description  of  a  specific 
launch  event  involving  a  particular  spacecraft  (e.g.,  Skylab),  launched  from  a  particular 
launch  site  (the  Kennedy  Space  Center),  by  a  particular  launch  system  (a  Saturn-5  type 
launch  vehicle),  into  a  particular  orbit  (the  Skylab  orbit  had  an  inclination  of  50  degrees 
to  the  equator),  at  a  particular  time  and  date  (1300  hours  on  14  May  1973). 

The  representational  construct  for  Intensional  descriptions  of  events  and  their  associ¬ 
ated  concepts  is  the  “template"  (see  subsection  2.4.1),  while  that  for  the  extensional 


description  of  events  and  their  associated  concepts  is  the  "event  record"  (see  subsec¬ 
tion  2.4.2).  The  relation  between  a  template  and  its  corresponding  event  record  is 
roughly  the  same  as  that  which  holds  between  an  intensional  description  of  a  concept 
and  its  extension.  Thus,  the  set  of  event  records  describing  events  of  the  same  class, 
i.e.,  event  records  related  to  a  particular  template,  constitute  the  extension  of  the  con¬ 
cept  described  by  the  template. 

The  collection  of  reports  describing  an  EVENT  constitute  an  EVENT  REPORT.  In  the  mili¬ 
tary  environment,  an  EVENT  REPORT  comprises  a  variable  number  of  discrete  intelligence 
messages,  which  are  often  received  hours,  days,  weeks,  or  even  years  apart. 

The  messages  fall  into  definable  classes  which  have  important  methodological  implica¬ 
tions  for  event  processing.  Each  message  type  has  an  internal  conceptual  organization 
reflected  in  the  linguistic  organization  of  the  message  text.  The  next  few  subsections 
describe  the  major  characteristics  of  EVENT  REPORTS,  and  the  internal  conceptual  and 
linguistic  organization  of  message  text. 

2.2  Characteristics  of  EVENT  REPORTS 

One  of  the  fundamental  properties  of  an  EVENT  REPORT  is  that  it  is  coherent  with 
respect  to  a  global  theme.  Thus,  if  the  central  theme  of  an  EVENT  REPORT  is  the  mission 
of  a  particular  spacecraft,  the  atomic  events  described  by  individual  messages,  including 
the  comments  offered  and  the  inferences  stated,  are  all  in  some  intuitive  way  related  to 
that  theme. 

Precise  rules  for  establishing  coherence  at  this  level,  however,  are  very  difficult  to  for¬ 
mulate.  Although  it  is  clear  that  determining  coherence  involves  domain-specific 
knowledge  of  objects,  their  properties  and  their  behavior  In  the  real  world,  as  well  as 
knowledge  of  conventions  governing  reporting  procedures,  the  inferential  procedures  for 
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establishing  coherence  cannot  be  explicitly  formulated  at  this  time. 

An  attempt  was  made  to  identify  some  of  the  factors  which  might  enter  into  the  estab¬ 
lishment  of  links  between  messages  in  the  missile  and  satellite  domain.  Two  notions 
emerged  as  crucial  in  this  area:  the  time  reference  of  a  reported  event,  and  the 
discourse  referent,  i.e.,  the  entity  discussed.  (For  further  discussion,  see  subsections 

2.3. 4.3  and  2.5). 

It  is  clear  that  the  procedures  involved  In  event  processing  at  the  level  of  the  EVENT 
RECORD  must  be  based  upon  a  cognitive  theory  of  discourse  comprehension  utilizing  real 
world  knowledge  of  how  the  objects,  facts,  processes  and  events  are  organized  in  a 
particular  subject  domain. 

2.3  Characteristics  of  Messages 

As  mentioned  above,  the  individual  messages  in  message  sequences  constituting  EVENT 
REPORTS  usually  fall  into  several  identifiable  classes.  From  the  point  of  view  of 
automated  computer  analysis,  a  distinction  must  be  made  between  those  messages  that 
contain  new  event  descriptions  (i.e.,  descriptions  of  events  reported  for  the  first  time), 
and  those  that  either  confirm  previously  reported  events,  request  changes  in  the  param¬ 
eters  of  some  previously  reported  event,  add  information  to  previously  underspecified 
parameters,  or  provide  summaries  of  global  events.  From  an  operational  point  of  view,  a 
first  report  involves  creating  a  new  data  element,  while  elaborations,  requests  for 
change  and  updates  involve  changes  and/or  additions  to  an  already  existing  structure. 

Messages  have  a  complex  internal  structure  comprising  header  information,  followed  by 
either  formatted,  semi-formatted,  and/or  unformatted  (narrative)  text  portions,  before 
ending  with  some  special  symbols  signalling  the  conclusion  of  the  message. 
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Since  this  work  is  concerned  mainly  with  the  narrative  text  portions  of  messages,  the 
latter  are  described  in  terms  of  three  components:  a  ’pre-text’  component,  the  ’text* 
component,  and  a  'post-text'  component.  The  next  two  subsections  focus  on  the 
characteristics  of  the  ’text’  component  of  messages. 

2.3.1  The  Conceptual  Organization  of  Message  Text.  This  subsection  focuses  on  the 
conceptual  organization  of  message  text.  In  general,  the  ’text’  component  of  a  message 
contains  information  as  to  the  time  and  location  of  a  given  atomic  event,  and  may  contain 
additional  data  giving  the  context  of  the  event  sequence  or  chain  of  related  events, 
properties  of  objects  involved,  the  source  of  the  information,  and  some  interpretation  of 
the  event. 


The  majority  of  event-related  messages  have  a  characteristic  structure  which  may  be 
represented  by  the  following  formula,  where  the  parentheses  enclose  optional  elements, 
and  curly  brackets  enclose  alternatives: 

(S)  E 

S  represents  references  to  the  source  of  the  information,  and  is  an  optional  element. 
The  following  single-sentence  message  contains  a  source  reference  represented  by  the 
String  THE  SOVIET  NEWS  AGENCY  TASS: 


.  THE  SOVIET  NEWS  AGENCY  TASS  ANNOUNCED 
THAT  IMPACT  OF  COSMOS-954  TOOK  PLACE 
NEAR  YELLOWKNIFE,  CANADA. 

E  symbolizes  the  key  event  being  reported  in  the  message,  while  L  represents  the  loca¬ 
tion  and  T  the  time  of  the  given  event.  In  the  above  example,  the  key  event  is  the 
impact  of  Cosmos-954,  and  the  location  is  Yellowknife,  Canada.  The  time  of  the  impact 
is  not  mentioned.  When  time  information  is  omitted,  it  can  be  derived  from  the  header  of 
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is  not  mentioned.  When  time  information  is  omitted,  it  can  be  derived  from  the  header  of 
the  message,  or  from  a  combination  of  other  factors,  including  the  tense  of  the  main 
verb.  The  latter  procedure  often  involves  complex  inferential  processes  which  are  dif¬ 


ficult  to  implement. 

E’  symbolizes  further  information  on  the  key  event,  e.g.,  properties  of  the  objects 
involved,  and  other  occurrences  in  the  chain  of  events  reported. 

I  represents  further  information  on  the  event.  The  latter  often  takes  the  form  of  evalua¬ 
tive  comments  on  surrounding  circumstances  or  consists  of  sentences  describing 
relevant  historical  background.  Interpretative  comments  are  often  absent  from  mes¬ 
sages,  but  in  those  cases,  more  detailed  information  on  the  key  event  itself  is  usually 
given. 

Messages  containing  more  than  one  paragraph  are  generally  structured  along  the  same 
lines,  where  each  additional  paragraph  reports  a  related  event,  with  associated  time  and 
location  data,  details,  and  Interpretive  comments. 

In  event-related  messages,  a  description  of  the  key  event  and  Its  parameters  occurs  in 
the  first  sentence  of  the  text.  Thus,  It  is  always  the  first  sentence  of  the  message 
text  which  introduces  the  TOPIC  of  the  message. 

2.3.2  The  Description  of  Message  Content.  In  order  to  describe  the  information  content 
of  message  text,  we  utilize  the  "Text  Grammar"  approach  as  developed  in  the  writings 
of  Petofi  (1971),  Petofl  and  Rieser  (1973),  van  Dijk  (1972,  1979),  van  Dijk  and  Petofi 
(1977),  and  Dressier  (1978a,  1978b). 

As  the  name  implies,  the  unit  of  analysis  of  a  text  grammar  is  a  text,  in  our  case,  the 
message  text.  The  aim  of  a  text  grammar  is  to  provide  an  abstract  linguistic  description 


utilizing  as  many  levels  of  analysis  as  necessary  to  make  explicit  the  content  structure 
of  the  text. 

Our  approach  to  this  problem  is  to  describe  the  meaning  content  of  the  message  in  terms 
Of  a  "Message  Grammar"  in  which  the  "primitives"  are  event  classes,  and  the  relations 
are  text-level  relations.  The  latter  may  be  optional  or  obligatory  and  determine  the  con¬ 
nectivity  or  non-connectivity  between  events. 

The  definition  of  the  Message  Grammar  is  based  upon  a  systematic  study  of  the 
language  used  in  a  particular  subject  domain.  The  results  of  the  study  for  the  missile 
and  satellite  sublanguage  are  pesented  in  Section  3  of  this  report. 

The  formulation  of  a  message  grammar  for  event  data  requires  a  multi-level  approach, 
beginning  with  the  description  of  the  text  at  the  level  of  words  and  sentences  and  con¬ 
cluding  with  a  description  of  the  information  content  of  the  message  in  terms  of  higher- 
level  abstract  units.  Each  level  of  description  involves  its  own  units  and  principles  of 
analysis. 

2.3.3  The  Linguistic  Organization  of  Message  Text.  At  the  linguistic  level  a  message 
text  Is  characterized  in  terms  of  a  sequence  of  propositions.  It  is  important  to  note, 
however,  that  a  message  text  is  not  just  an  arbitrary,  unrelated  set  of  propositions;  it  is 
a  coherent,  structured  conceptual  unit,  whose  individual  propositions  are  linked  by  means 
of  text-level  relations  including  coreference,  temporal  relations,  causal  relations,  entail- 
ments  and  presuppositions. 

The  formulation  of  conditions  for  textual  coherence  is  an  essential  part  of  the  linguistic 
characterization  of  a  message  text  and  plays  an  important  role  In  message  text 
analysis.  Some  of  these  relations  are  explicitly  expressed  in  the  surface  structure  of 
the  text;  others  are  inferred  during  the  Interpretation  process  on  the  basis  of  contex- 
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tual  and  real  world  knowledge.  In  addition,  It  Is  Important  to  note  that  for  a  text  to  be 
coherent,  Its  propositions  need  not  necessarily  be  connected  In  an  explicit  manner  at 
the  text  level,  since  much  of  the  information  conveyed  by  a  text  is  implied,  it  is  suffi¬ 
cient  that  a  text  be  coherent  at  a  higher  level  of  abstraction  with  respect  to  the  TOPIC 
of  the  message,  i.e.,  the  entity  discussed. 

The  next  higher  level  of  analysis  is  the  Event  Level.  The  linguistic  level  is  related  to  the 
Event  level  by  a  set  of  mapping  rules  which  transform  the  individual  propositions  of  the 
text  into  event  representations.  The  mapping  rules  are  based  upon  a  study  of  the 
correspondences  between  form  and  content  in  a  given  domain.  Examples  of  such 
correspondences  taken  from  the  domains  under  consideration  are  given  in  the  next  sec¬ 
tion. 

2.3.4  Description  at  the  Event  Level.  This  level  involves  the  description  of  atomic 
events  in  terms  of  their  properties,  including  time,  location,  action,  objects,  and  related 
facts.  For  each  subject  domain,  the  description  of  message  content  at  this  level 
requires  that  the  set  of  characteristic  event  classes  be  identified  together  with  the  set 
of  operations  and  relations  on  events  that  hold  within  the  domain.  This,  in  turn,  requires 
a  thorough  analysis  of  the  vocabulary  of  the  sublanguage,  and  lexical  analysis,  and  a 
study  of  text-level  relations.  The  following  sections  sketch  OSI’s  approach  to  these 
requirements. 

2.3.4.1  Vocabulary  Analysis.  A  KWIC  concordance  of  the  message  sentences  is  first 
prepared  to  serve  as  a  basic  analytical  tool  for  development  of  the  dictionary  and  gram¬ 
mar  (see  subsection  2.6.3).  The  vocabulary  yielded  by  the  initial  KWIC  index,  is  parti¬ 
tioned  into  those  items  which  are  entered  into  the  system’s  dictionary  and  those  that 
are  members  of  open  sets  and  must  be  treated  heuristically. 

#  A  dictionary,  or  lexicon,  Is  a  file  of  Information  about  the  words  of  the  language. 
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The  first  category  is  made  up  of  natural  language  words  (nouns,  pronouns,  adjectives, 
verbs,  adverbs,  conjunctions,  subjunctions,  etc.).  The  second  category  comprises 
number  strings,  date  time  groups,  regiment  numbers,  division  numbers,  and  other 
alphanumeric  strings  such  as  designations  of  equipment  types  and  launch  systems. 
Furthermore,  this  category  also  includes  geographic  designations,  names  of  countries, 
areas,  zones,  cities,  rivers,  etc.,  which  require  the  use  of  special  dictionaries  for  their 
recognition. 

Heuristics  for  automatically  recognizing  and  labelling  lexical  items  which  are  members  of 
open  sets  (e.g„  types  of  equipment,  place  names,  geographical  coordinates)  involves 
writing  specific  recognition  routines  involving  anything  from  pattern-matching  to  syntac¬ 
tic  prediction. 

In  addition  to  a  common  core  of  English  language  words  utilized  across  subject  domains, 
each  individual  subject  domain  requires  Its  special  dictionary  reflecting  domain-specific 
nomenclature  and  usage. 

Since  the  lexicon  has  to  mesh  both  with  the  parser  and  the  Interpretive  routines,  the 
task  of  building  the  lexicon  is  a  continuous  effort  extending  over  the  entire  period  of 
development. 

2.3 .4.2  Lexical  Analysis.  The  results  of  the  vocabulary  analysis  form  the  basis  of  the 
lexical  analysis,  which  in  turn  underlies  the  definition  of  new  templates. 

The  goal  of  lexical  analysis  is  the  description  of  the  concepts  expressed  by  words  and 

the  identification  of  their  conceptual  relations  to  other  words*.  All  key  concepts,  their 

Each  word  entry  is  accompanied  by  information  regarding  Its  syntactic  category,  in 
what  syntactic  environments  it  can  occur,  and  what  some  of  its  semantic  properties 

are. 

*  Note  that  the  term  "word"  is  used  loosely  to  refer  to  single  words  or  combinations  of 
words  generally  considered  to  form  a  unit  of  meaning. 
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attributes,  their  internal  argument  structure,  and  their  external  relations  to  other  con¬ 
cepts  must  be  determined  for  each  subject  domain.  The  following  paragraphs  provide 
some  background  on  OSI’s  approach  to  lexical  analysis. 

Information  on  words  is  characteristically  acquired  by  experience.  It  is  generally 

accepted  that  such  information  includes  spelling,  pronunciation,  inflected  and  derived 

> 

forms,  major  syntactic  category,  knowledge  about  how  words  combine  with  other  words 
to  form  grammatical  phrases  and  sentences,  knowledge  about  semantic  relations 
between  words  (e.g.,  synonymy,  antonymy,  hyponymy),  knowledge  about  semantic  fields, 
rules  for  appropriate  use  in  a  given  situation,  and  other  encyclopedic  facts.  Knowledge 
about  words  underlies  a  person’s  faculty  to  produce  and  comprehend  language  and  to 
communicate  with  other  people  in  a  manner  appropriate  to  a  given  situation.  The  mean¬ 
ings  of  many  words  depend  upon  functional  and  perceptual  attributes,  as  well  as  the 
place  that  word  occupies  within  a  system  of  concepts. 

One  of  the  goals  of  lexical  analysis  is  to  define  the  boundaries  of  the  lexical  universe 
for  the  domain  under  study.  Within  a  given  subject  domain,  representations  of  concepts 
expressed  by  words  need  only  include  what  is  relevant  to  the  particular  domain. 

To  Illustrate  the  kinds  of  issues  involved  in  formulating  hypotheses  about  concepts 
expressed  by  words,  their  internal  relations  and  their  links  to  other  concepts,  we  offer 
the  following  two  examples. 

2. 3. 4.2.1  Example  1;  Hyponymy.  Hyponymy  is  generally  considered  to  be  one  of  the 
most  important  principles  underlying  the  organization  of  nominal  concepts.  It  has  been 
the  subject  of  many  linguistic  Investigations  —  especially  in  connection  with  kin  terms, 
color  terms,  and  plant  and  animal  taxonomies. 

According  to  Lyons  (1Q68),  hyponymy  is  the  relation  which  holds  between  a  more 
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specific,  or  subordinate,  lexical  unit  and  a  more  general,  or  superordinate,  lexical  unit,  as 
exemplified  by  the  pair  of  words  ’Skylab’  and  ’spacecraft',  where  ’Skylab’  is  a  hyponym 
of  ’spacecraft’.  Under  this  definition.  Pioneer  11,  Voyager,  and  Viking  are  co-hyponyms 
of  the  superordinate  term  ’spacecraft’.  Hyponymy  has  sometimes  been  defined  in  terms 
of  the  logical  relation  of  class-inclusion  (see,  for  example,  Carnap,  1 956).  According  to 
Lyons  (1977),  however,  there  are  problems  attaching  to  the  definition  of  hyponymy  in 
terms  of  the  logic  of  classes.  He  proposes  that  hyponymy  be  defined  in  terms  of  unila¬ 
teral  implication,  as  follows: 

Word  Wi  is  a  hyponym  of  word  Wj  if,  for  any  x,  the  sentence 
”x  is  a  Wi"  entails  the  sentence  "X  Is  a  Wj". 

The  relation  of  hyponymy  imposes  a  hierarchical  structure  upon  a  vocabulary  and  upon 
particular  fields  within  a  vocabulary.  Knowledge  of  hyponymy  relations  is  essential  to 
the  resolution  of  discourse  reference,  and  plays  an  important  role  in  information  retrieval. 
In  Library  Science,  it  underlies  the  assignment  of  “See  also"  references. 

Hyponymy  Is  an  external  relation  between  concepts.  Example  2  below  illustrates  a 
class  of  internal  relations  which  are  often  referred  to  as  "case"  relations  (Fillmore, 
1968). 

2.3.4.2.B  Example  2:  Case  Relations.  Natural  4anguage  cases  are  widely  recognized  as 
an  important  organizing  principle  in.  the  analysis  and  description  of  natural  language 
data.  Many  of  the  advanced  projects  for  natural  language  understanding  in  the  U.S.  as 
well  as  abroad  embody  some  sort  of  case  system. 

A  "case"  is  a  binary  relation  which  holds  between  the  predicate  (usually,  but  not  neces¬ 
sarily,  realized  as  a  verb)  and  one  of  Its  arguments.  A  case  analysis  determines  the 
semantic  roles  of  the  components  of  an  expression  with  respect  to  a  central  concept. 
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Consider,  for  example,  the  sentence: 

A  SATURN-6  TYPE  LAUNCHER  WAS  USED  TO  PLACE  THE 
SPACECRAFT  INTO  ORBIT. 

The  main  predicative  concept  "USE"  denotes  an  action,  and  as  such  determines  the 
roles  of  the  other  components  of  the  sentence  with  respect  to  it.  Thus,  “A  Saturn-6 
type  launcher"  fulfills  the  role  of  "instrument",  and  "to  place  the  spacecarft  into  orbit" 
the  role  of  "purpose"  in  relation  to  "USE".  These  roles  are  purely  semantic  and 
describe  the  internal  relations  of  the  concept  USE  to  other  constructs  in  its  immediate, 
syntactic  environment.  Continuing  our  analysis,  we  note  that  the  "purpose"  argument 
can  be  further  decomposed  into  the  predicative  concept  "place",  the  noun  phrase  "the 
spacecraft",  and  the  prepositional  phrase  "into  orbit".  The  noun  phrase  "the  space¬ 
craft"  fulfills  the  role  of  "object",  while  the  prepositional  phrase  "into  orbit"  fulfills  the 
role  of  "locational  goal"  in  relation  to  "place". 

Correlations  between  the  text  of  our  example  and  its  logical  argument  structure  are 
expressed  by  what  linguists  refer  to  as  the  "selectional  restrictions"  or  the  "selectional 
preferences"  of  a  particular  verb.  Some  of  the  selectional  preferences  for  the  verb 
"use”  are  given  below: 

(a)  The  Agent,  if  expressed  in  surface  structure,  corresponds  to 
the  "logical"  subject. 

(b)  The  Instrument  is  expressed  by  the  object  noun  phrase  denoting 
any  object  which  can  be  "used". 

(c)  The  Purpose,  if  expressed  in  surface  structure,  is  a  "to" 
complement,  or  a  "for"  complement. 

Internal  relations  may  be  obligatory  or  optional.  For  example,  with  the  predicate  USE,  the 


purpose  for  which  something  is  used  need  not  always  be  explicitly  expressed  in  surface 
structure.  The  verbal  gerund  "Using  Message  Spooling  Processors  in  a  Non-interactive 
Network"  does  not  spell  out  the  purpose.  It  does,  however,  give  an  indication  of  the 
environment  ("in  a  non-interactive  network").  Both  Purpose  and  Environment  are  optional 
arguments  of  USE.  Mention  of  the  Instrument  argument,  is,  however,  mandatory. 

In  conclusion,  this  brief  analysis  of  the  two  examples  given  above  identifies  only  a  small 
number  of  the  semantic  relations  which  can  hold  among  words.  An  in-depth  lexical  study 
must  come  to  grips  with  issues  relating  to  polysemy  and  lexical  disambiguation,  including 
the  problem  of  core  senses  and  the  way  a  core  sense  can  be  extended  to  provide  other 
senses;  with  morphological  relations,  including  inflectional  and  derivational  word  forma¬ 
tion,  as  well  as  with  the  meaning  of  compounds,  with  the  notion  of  "semantic  field",  and 
with  issues  of  representation.  In  addition,  the  role  of  presupposition  must  be  clarified. 

Presuppositions  and  entailments  are  of  particular  importance  for  event  data  analysis, 
because  they  make  predictions  about  possible  sequences  of  events.  If  any  of  these 
predictions  are  violated,  they  must  be  brought  to  the  attention  of  the  analyst. 

Z.3.4.3  The  Study  of  Text  Level  Relations.  By  "text-level  relations"  we  mean  those  rela¬ 
tions  that  connect  atomic  propositions  in  a  running  text.  They  include  such  relations  as 
synonymy,  hyponymy,  part/whole  relations,  causal  relations,  temporal  relations,  pronomi¬ 
nal  reference,  noun  phrase  reference,  and  temporal  reference. 

Factors  that  enter  into  text  level  relations  are  implications  in  the  semantic  content  of 
constituent  propositions,  lexical  equivalence,  and  syntactic  devices  such  as  time  and 
place  relators,  logical  connectors,  and  the  use  of  proforms. 

For  example,  consider  the  following  message  fragment,  which  consists  of  two  clauses, 
each  describing  an  atomic  event: 
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THE  SATELLITE  DEORBITED  AFTER  COMPLETING  A  28  DAY  MISSION. 

The  event  described  by  the  main  clause  (the  satellite  deorbited),  stands  in  the  time 
relation  of  succession  to  the  event  described  by  the  subordinate  clause:  the  deorbit  of 
the  satellite  is  understood  to  have  taken  place  after  a  mission  which  lasted  for  28  days. 

This  time  relation  is  explicitly  stated  in  the  text,  and  can  be  derived  from  the  semantics 
of  the  two  clauses,  including  the  meaning  of  the  subordinate  conjunction  ’after’.  The 
semantic  interpretation  of  the  time  relation  between  the  two  propositions  is  defined  as 
follows: 

BEFORE(P.P’) 

where  P  represents  the  28  day  mission,  and  P’  the  deorbit  of  the  satellite. 

In  other  cases,  links  between  propositions  of  a  text  can  be  established  by  means  of  the 
hyponymy  relation  holding  between  words  contained  in  the  text.  To  illustrate  this,  con¬ 
sider  the  following  pair  of  consecutive  sentences: 

(a)  SPUTNIK  1  WAS  OBSERVED  IN  ORBIT 
OVER  THE  INDIAN  OCEAN  ON  FEBRUARY  1 1 . 

(b)  THE  SATELLITE  LATER  REENTERED  THE  EARTH’S  ATMOSPHERE 
AND  BROKE  UP  UPON  REENTRY. 

Here  the  noun  phrase  "the  satellite"  is  a  more  general  term  for  "Sputnik  1",  the  relation 
between  the  two  terms  being  one  of  hyponymy  (subsection  2. 3.4.2).  The  link  between 
the  two  sentences  Is  one  of  noun  phrase  reference. 

Equivalence  between  two  noun  phrases  is  sometimes  explicitly  stated  in  a  text  as  in  (c) 

below: 

(c)  THE  FIRST  EARTH  SATELLITE,  ALSO  KNOWN  AS  SPUTNIK  1 ... 
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In  the  previous  example,  the  information  which  permitts  the  resolution  of  the  noun  phrase 
reference  is  present  in  the  immediate  context.  At  times  lexical  connection  between 
sentences  may  not  be  given  overtly,  but  may  depend  upon  factual  knowledge  that  the 
originator  of  the  text  assumes  on  the  part  of  the  reader.  Consider  the  following  pair  of 
sentences  where  the  definite  noun  phrase  refers  to  a  concept  not  mentioned  in  the  pre¬ 
vious  context,  but  is  conceptually  linked  to  it. 

(d)  A  FLIGHT  OF  TWO  AIRCRAFT  WAS  ACTIVE  OVER 
PLACENAME  DURING  THE  LATE  ZULUTIME  HOURS. 

(e)  THE  PILOTS  WERE  IN  COMMUNICATION  WITH  AN 
UNIDENTIFIED  CONTROL  AND  REPORTING  CENTER. 

The  problem  here  is  to  establish  the  referents  of  the  definite  noun  phrase  "the  pilots", 
which  has  not  been  mentioned  in  the  previous  context.  In  cases  such  as  these,  the 
disambiguation  may  be  handled  by  what  Chafe  (1972)  describes  as  "foregrounding". 
Chafe  argues  that  whenever  a  new  concept  is  introduced  into  a  communication,  that 
concept  introduces  a  number  of  related  concepts  into  the  local  context  or  foreground. 
For  example,  once  the  concept  of  "aircraft"  has  been  mentioned  In  a  text,  we  can  use 
definite  reference  for  its  engine,  fuselage,  wings,  and  even  its  pilot.  Thus,  in  the  pair  of 
sentences  above,  sentence  (d)  introduces  the  concept  of  "aircraft"  which  then 
automatically  brings  into  focus  closely  associated  concepts  including  the  no'  on  of 
"pilot",  which  is  referred  to  by  a  definite  noun  phrase  in  (e). 

In  an  automated  system,  the  fact  that  aircraft  have  pilots  is  stored  in  the  permanent 
knowledge  base  of  the  system  and  is  thus  readily  accessible  to  the  routines  for  ana¬ 
phora  resolution. 
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2.4  Representational  Issues 


This  section  discusses  two  representational  constructs  of  fundamental  importance  to 
event  processing:  the  Template  and  the  Event  Record. 

2.4.1  The  Template.  Taking  the  event  as  the  primary  unit  of  analysis,  OSI  has  developed 
the  concept  of  a  "template"  as  an  organizing  principle  for  the  uniform  representation  of 
information  on  events  and  event-related  entities,  as  viewed  from  the  perspective  of  the 
user/analyst  in  the  context  of  a  particular  task  domain. 

Events  and  event-related  entities  are  described  as  n-ary  relations,  where  the  n-ary 
relationships  is  named  by  a  predicate  symbol  and  the  arguments  of  the  relation 
correspond  to  the  "roles"  of  case  theory  (see  subsection  2.3.4.2). 

An  important  part  of  the  information  encoded  in  templates  consists  of  what  linguists  call 
the  "selectional  restrictions"  or  "setectional  preferences”  in  a  particular  domain.  In 
essence,  the  latter  express  the  correspondences  between  syntax  and  semantics,  and 
play  a  fundamental  role  in  semantic  interpretation.  They  form  the  basis  for  constructing 
the  procedures  which  map  syntactically  analyzed  input  sentences  into  event  records 
and  thus  have  the  function  of  reducing  the  many  ways  a  concept  can  be  expressed  in 
natural  language  to  a  systematic  representation  of  the  information  content  of  the  input 
sentence. 

Although  most  correspondences  between  syntax  and  semantics  hold  across  subject 
domains,  each  domain  seems  to  present  some  idiosyncratic  usage,  it  is  important,  there¬ 
fore,  that  the  correspondences  between  syntax  and  semantics  be  studied  in  depth  for 
each  subject  domain  before  they  can  be  translated  into  algorithms  and  incorporated  into 
templates. 
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In  the  domains  under  investigation  there  are  templates  for  classes  of  objects  (aircraft, 
missiles),  classes  of  events  (flights,  launchings),  classes  of  relations  (temporal,  causal), 
and  other  concepts  such  as  the  date  time  group.  For  example,  a  template  describing  the 
class  of  flight  events  in  the  context  of  the  air  activities  domain  includes  parameters 
such  as  time  of  flight,  the  aircraft  involved  in  the  flight,  the  purpose  of  the  flight  (its 
mission)  ,  the  point  of  departure,  the  destination,  the  current  location,  the  farthest  point 
reached,  direction,  altitude,  and  path.  The  aircraft  involved  in  a  flight  are  in  turn 
described  in  terms  of  aircraft-related  parameters  such  as  equipment  type,  nationality, 
organizational  subordination,  and  so  on.  These  are  all  parameters  which  can  enter  into  a 
flight  event  and  are  therefore  part  of  the  cognitive  model  of  the  analyst,  i.e.,  of  his  view 
of  what  a  flight  involves. 

Table  2-a  provides  an  informal  description  of  the  LAUNCH  template  as  developed  for  the 
Missile  and  Satellite  domains.  The  template  embodies  a  set  of  descriptors  selected  from 
a  small  set  of  descriptor  types.  Each  descriptor  has  a  procedure  attached  to  it  which 
incorporates  the  information  necessary  to  relate  abstract  descriptions  of  concepts  to 
syntactic  structures.  They  are  essentially  mapping  rules  which  effect  the  transforma¬ 
tion  of  parsed  sentences  into  event  records, 

In  a  sense  a  template  is  like  a  class  declaration,  a  structure  naming  the  attributes 
(descriptor  slots)  which  are  optionally  or  obligatorily  associated  with  instances  of  that 
class,  and  specifying  the  values  those  attributes  can  have. 

Templates  integrate  procedural  knowledge  with  a  richly  structured  declarative  represen¬ 
tation.  In  this  sense,  templates  have  a  lot  in  common  with  both  Wilks’(1977)  recursive 
semantic  formulas  and  Bobrow  and  Winograd’s  (1977)  KRL  Specialization  Units. 
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Table  2-a.  Informal  Description  of  the  LAUNCH  Concept 
In  the  Missile  and  Satellite  Domain. 
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Table  2-a  (contd) 
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Each  subject  domain  yields  its  own  template  inventory  corresponding  to  the  events  and 
objects  and  their  internal  and  external  relations  —  optional  or  obligatory  --  which  have 
informational  significance  within  that  domain.  In  the  Missile  and  Satellite  domain,  some  of 
the  key  concepts  for  which  templates  have  been  constructed  are:  "launch",  "deorbit", 
"reentry",  "breakup",  "impact",  "missile",  "satellite",  and  "DTG".  (For  a  complete  listing 
of  the  template  inventory  and  the  auxiliary  procedures  as  encoded  in  the  language  Pro¬ 
log,  see  Appendix  C). 

In  summary,  the  template  is  an  information  structure  which  provides  the  means  for  coding 
the  analyst’s  cognitive  models  in  terms  of  logical  data  structures  which  are  susceptible 
to  automatic  processing.  In  other  words,  templates  provide  a  framework  for  the 
representation  of  higher-level  conceptual  information  approximating  that  which  a  human 
reader  has  of  a  given  subject  matter. 

Logically,  templates  can  be  viewed  as  relational  network  models  of  memory  in  which 
primitives  are  relations,  words,  and  word  senses.  Procedurally,  they  can  best  be 
described  as  the  fundamental  knowledge  structures  which  mediate  the  correlations 
between  syntactic  structures  and  their  corresponding  information  content. 
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Each  template  describes  «  class  of  entities  in  terms  of  those  properties  which  are  nor¬ 
mally  associated  with  that  class  in  a  particular  task  domain.  A  template  thus  reflects 
the  Information  user’s  conceptualization  of  the  domain,  i.e.,  his  view  of  what  that  class  of 
entities  involves. 

2.4.2  The  Event  Record.  While  templates  are  abstract  data  structures  for  the  represen¬ 
tation  of  event  classes,  Event  Records  are  concrete  data  structures  for  the  representa¬ 
tion  of  individual  events.  An  event  record  is  the  description  of  a  single  individual,  i.e.,  a 
unique  member  of  a  class  of  individuals  in  the  world  being  modeled.  A  simple  example  is 
the  description  of  a  specific  spacecraft  (e.g.,  Skylab)  which  was  launched  from  a 
specific  launch  site,  at  a  given  time  and  date.  Table  2-b  illustrates  the  event  record 
corresponding  to  the  LAUNCH  sentence  below: 


THE  SKYLAB  ORBITAL  WORKSHOP,  A  CONVERTED  S-4B 
THIRD  STAGE  FROM  A  SATURN-5  LAUNCH  VEHICLE, 
WAS  LAUNCHED  FROM  THE  KENNEDY  SPACE  CENTER 
AT  1300  HOURS  ON  14  MAY  1973. 


Table  2-b.  Event  Record  for  LAUNCH  Sentence 


Event:  LAUNCH 
Action:  LAUNCHED 
Object:  SATELLITE 

...  Equipments  SKYLAB  ORBITAL  WORKSHOP 
...  Numbers 

...  Relatives  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A 
SATURN- 5  LAUNCH  VEHICLE 
Launchsite=  FROM  THE  KENNEDY  SPACE  CENTER 
Time  =  AT  1300  HOURS 
Date  =  ON  14  MAY  1973 


Notice  that  the  above  sentence  does  not  provide  values  for  all  the  descriptors  associ¬ 
ated  with  the  LAUNCH  template.  Templates  represent  an  aggregate  of  all  the  possible 
parameters  and  attributes  which  can  be  associated  with  an  event  or  object  in  the  con- 


2-21 


text  of  a  given  domain.  Many  of  these  parameters  or  attributes  are  optional  and  there¬ 
fore  need  not  be  present  In  any  one  actual  description  In  a  text. 

Note  also  that  the  appositive  "a  converted  S-4B  third  stage  from  a  Saturn-5  launch 
vehicle"  is  analyzed  as  a  relative  clause  and  stored  in  the  "Relative"  descriptor  slot. 
For  a  discussion  of  the  current  treatment  of  relative  clauses  the  reader  is  referred  to 
the  subsections  on  relative  clauses  and  appositive  postmodification  in  Section  3  of  this 
report. 

Event  Records  have  several  important  properties  which  render  them  particularly  useful 
as  a  support  tool  for  the  l&W  analyst: 

•  They  reflect  the  analysts  view  of  the  world,  and  are  thus  compatible  with  their 
cognitive  models  of  objects,  events  and  states  of  affairs  in  their  area  of  exper¬ 
tise. 

•  They  are  discrete  representations  of  events,  objects,  and  their  properties  and 
are  usable  for  the  construction  of  a  data  base. 

•  They  are  so  designed  as  to  allow  flexible  retrieval  of  information  not  only  by 
event  type,  but  also  by  other  associated  parameters,  such  as  object(s)  involved 
in  the  event,  and  time  and  location  indicators. 

•  The  information  stored  in  these  data  structures  is  in  a  format  which  lends  itself 
readily  to  further  processing.  This  processing  may  be  related  to  storage  and 
retrieval  functions,  may  be  statistical  in  nature,  or  may  be  part  of  the  inference 
making  mechanisms  to  be  developed  for  a  future  system  (e.g.,  a  system  for  event 
prediction). 

The  next  section  deals  primarily  with  discourse  reference.  The  section  begins  by  briefly 
defining  the  problem  of  reference,  and  then  goes  on  to  discuss  several  forms  of 
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reference  common  In  our  sublanguage. 

2.5  The  Problem  of  Reference 

A  more  complete  computer  ’understanding’  of  narrative  text  requires  the  machine  to 
have  the  ability  to  deal  with  anaphoric  language  In  a  perspicuous  and  systematic  way. 
This  ability  is  especially  important  for  the  analysis  of  larger  texts  characteristic  of  intel¬ 
ligence  messages. 

Two  of  the  major  surveys  discussing  the  problems  of  anaphora  are  Nash-Webber  (1077) 
and  Hirst  (1979). 

Computer-based  attempts  to  handle  anaphoric  expressions  are  described  in  Baranovsky 
(1970),  Burton  (1976),  Charnlak  (1972,  1973),  Deutsch  (1975),  Hobbs  (1976), 
Rumelhart  (1976),  Wilks  (1975),  Wlnograd  (1972),  Woods  (1972),  and  others. 

information  contained  in  a  message  falls  into  two  distinct  categories.  At  each  point  in  a 
message  text  some  of  the  information  is  "new":  i.e.,  it  is  being  introduced  into  the  text 
by  the  sentence  being  analyzed  at  that  point.  Other  information  is  "given",  or  "old",  i.e., 
it  has  previously  been  introduced  in  the  text  and  we  assume,  stored  in  memory,  much  as 
the  human  processes  sequential  text.  Interpreting  a  text  requires  identifying  given  con¬ 
cepts  In  memory  and  attaching  the  new  information  to  them.  In  the  following  discussion, 
the  term  "referring  expression"  is  used  to  denote  those  parts  of  a  sentence  that  com¬ 
municate  given  information. 

The  problem  of  reference,  then,  Is  the  problem  of  Identifying  the  concepts  referred  to  in 
a  text.  Such  concepts  need  not  explicitly  be  expressed  as  segments  in  a  text;  they 
very  often  are  entities  which  are  assumed  to  be  in  the  reader’s  mind.* 

*  For  a  discussion  of  the  general  problem  of  reference  see  RADC-TR-77-194,  Vol.  I, 
Part  I,  Section  2.1 . 


Anaphoric  expressions  comprise  pronouns,  pro-verbs,  some  definite  noun  phrases  and 
ellipses.  The  ensuing  remarks  are  restricted  to  definite  referring  expressions. 

2.5.1  Definite  Referring  Expressions.  For  the  purposes  of  discussion  it  is  useful  to  dis¬ 
tinguish  two  kinds  of  definite  referring  expressions:  pronouns  and  non-pronominal  defin¬ 
ite  noun  phrases.  The  reason  for  this  distinction  is  that  the  processes  needed  to  iden¬ 
tify  the  concepts  referred  to  by  pronouns  differ  from  those  needed  for  the  resolution  of 
non-pronominal  definite  noun  phrases. 

The  test  data  used  under  this  contract  contains  both  pronominal  and  non-pronominal 
definite  noun  phrases  (e.g.,  it,  their,  they,  the  spacecraft,  the  three  missiles  launched 
from  PLACENAME). 

2.5.1. 1  Pronominal  Reference.  Pronouns  carry  little  information  in  themselves.  Consider 
the  following  sentence: 

IT  IMPACTED  IN  THE  AUSTRALIAN  OUTBACK. 

The  pronoun  "it"  in  the  above  sentence  only  tells  us  that  the  subject  of  "impacted" 
must  be  singular.  To  help  identify  the  referent  of  "it"  we  must  determine  its  functional 
role  in  the  sentence  —  its  "case"  in  the  sense  of  Fillmore  (see  subsection  2.3.4.2  — 
and  make  use  of  the  syntactic  and  semantic  restrictions  on  the  slot  occupied  by  the 
pronoun.  Then  the  previous  context  can  be  searched  for  a  concept  that  satisfies  those 
restrictions.  In  the  given  example  the  restrictions  are  first,  that  the  referent  of  "it" 
must  be  singular,  and  second,  that  it  must  be  a  physical  object  capable  of  impacting.  In 
the  given  subject  domain,  such  an  object  is  either  a  missile  or  a  satellite. 

The  following  paragraphs  suggest  a  method  for  handling  pronoun  references  of  the  kind 
Illustrated  above. 
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The  above  sentence  would  first  be  syntactically  analyzed  Into  a  propositional  structure, 
and  subsequently  undergo  case  assignment;  the  pronoun  'It’  Is  assigned  the  role  of 
Object.  As  mentioned  above,  'it'  carries  the  lexical  information  that  it  is  definite  and 
singular.  The  fact  that  it  was  assigned  the  Object  role  indicates  that  its  referent  must 
belong  to  either  the  missile  or  the  satellite  class.  This  information  is  included  in  the  res¬ 
trictions  specified  in  the  intensional  description  of  the  IMPACT  event  concept.  Once  the 
program  determines  the  restrictions  on  the  referent  for  "it",  it  proceeds  to  examine  all 
descriptions  recently  stored  in  memory  to  find  one  which  meets  the  semantic  feature 
tests  for  the  pronoun’s  case  slot.  If  such  a  description  is  found,  the  program  will  ensure 
that  it  agrees  in  number  with  "it"  before  it  is  accepted  as  a  possible  referent.  If  the 
test  is  positive,  the  program  creates  a  link  from  the  Object  slot  of  the  event  record 
representing  the  sentence  under  discussion  to  the  description  just  identified. 

When  pronouns  refer  back  over  longer  portions  of  a  text  the  resolution  process  is  more 
complicated,  since  the  larger  the  previous  context,  the  more  numerous  are  the  potential 
referents  of  the  pronoun  and  the  more  complex  the  inferential  processing  that  has  to  be 
performed. 

The  next  section  examines  definite  noun  phrases  which  are  used  anaphorically,  i.e.,  the 
noun  at  the  head  of  the  construction  refers  to  a  specific  object  or  concept. 

2.5.1 .2  Definite  Noun  Phrases.  The  major  difference  between  pronouns  and  definite 
noun  phrases  is  that  the  latter  carry  more  information.  Thus,  the  phrase  ’the  four  SS-1 1 
missiles'  contains  information  specifying  both  the  general  class  (missiles)  and  the  type 
(SS-1 1 )  of  the  objects  referred  to. 

The  resolution  of  such  noun  phrases  is  basically  a  problem  of  finding  a  matching  descrip¬ 
tion  in  memory.  The  methods  required  to  decide  whether  a  given  entity  fits  a  g’ven 
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description  are  often  very  complex  and  include  all  kinds  of  inferential  processes. 

In  the  simplest  cases  are  those  which  involve  matching  a  definite  description  with  an 
object  that  has  been  described  in  the  same  way  previously  in  the  text.  Other  cases 
may  involve  complex  routines  using  the  semantics  of  descriptors  as  well  as  their  syntac¬ 
tic  form,  the  Immediate  linguistic  context,  the  world  knowledge  stored  In  the  system, 
external  data  files  (special  dictionaries,  glossaries,  etc.),  and  sometimes  even  the 
extensional  event  data  base  in  order  to  decide  whether  two  descriptions  match. 

Sometimes  a  definite  noun  phrase  may  refer  to  an  entire  chain  of  events.  In  such  cases 
it  is  necessary  to  appeal  to  structures  made  up  of  units  defined  at  a  higher  conceptual 
level  than  that  customary  in  traditional  linguistic  analyses. 

For  a  computer  program  to  do  this,  It  must  be  able  to  refer  not  to  the  individual  sen¬ 
tences  of  the  message,  but  to  the  chain  of  events  represented  by  these  sentences 
expressed  in  higher  level  conceptual  categories  appropriate  for  event  description,  thus 
making  use  of  the  "Message  Grammar". 

One  of  the  areas  into  which  this  research  might  profitably  be  extended  is  exploring  the 
possibility  of  resolving  anaphora  not  only  within  the  limited  context  of  a  single  message, 
but  also  within  the  larger  context  of  an  EVENT  REPORT. 

2.6  Guidelines  for  the  Establishment  of  a  Research  Corpus 

Because  of  the  nature  of  the  data  for  which  this  system  is  developed,  a  special 
approach  is  taken  to  the  establishment  of  a  research  corpus. 

2.6. 1  Data  Collection.  For  any  given  subject  domain,  the  first  task  relates  to  the  collec¬ 
tion,  "sanitization",  and  organization  of  a  representative  sample  of  Intelligence  messages 
to  serve  as  the  source  data  for  the  analysis  and  characterization  of  the  reporting 
language. 
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Data  should  be  gathered  In  cohesive  units  which  form  a  conceptual  whole  in  the  given 
domain.  Thus,  in  the  missile  and  satellite  domain,  messages  were  collected  in  sets  con¬ 
stituting  EVENT  REPORTS.  Only  if  so  organized  can  a  systematic  study  of  the  domain  be 
undertaken. 

A  sufficient  amount  of  data  needs  to  be  coflected  for  each  subject  domain.  The  notion 
"sufficient  data"  can  only  be  quantified  by  a  step  by  step  collection  of  data  in  each 
subject  domain.  It  is  a  well  known  fact  that  each  new  batch  of  data  collected  for  any 
given  domain  yields  less  and  less  new  Information.  At  a  certain  point  the  graph  of  addi¬ 
tional  new  items  vs.  total  information  levels  off.  It  is  at  that  point  that  further  data  col¬ 
lection  becomes  unproductive. 

2.6.2  Data  Sanitization.  The  source  data  gathered  for  purposes  of  development  is  usu¬ 
ally  classified.  It  can  therefore  not  be  used  in  its  original  form  for  in-house  development 
of  computer  programs  ic.  the  analysis  of  message  text.  Such  developmental  work  can 
be  accomplished  far  more  easily  with  unclassified  test  data  than  with  actual  messages, 
provided  the  test  data  has  the  same  grammatical  structure  as  the  original  material. 

To  achieve  this,  a  procedure  to  generate  unclassified  message  text  from  classified  data 
was  developed.  The  procedure  consists  of  two  steps. 

«  Step  1 .  Narrative  message  text  is  transcribed  according  to  a  set  of  transcription 
rules,  according  to  which  all  proper  names  of  objects,  their  attributes,  and 
time /location  parameters  are  replaced  by  placeholders.  For  example,  names  of 
political  entities  (e.g.,  USA,  UK)  are  replaced  by  the  string  "politname".  Specific 
^geographic  placenames  (e.g.,  Cape  Kennedy,  Florida)  are  replaced  by  the  string 
*placename".  Names  of  persons  (e.g.,  Pete  Conrad,  Valery  Bykovsky)  are 
replaced  by  the  string  "personriame,  while  times  and  dates  are  replaced  by 
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appropriate  placeholders.  A  complete  list  of  the  sanitization  rules  developed  for 
the  Missile  and  Satellite  domains  is  given  in  subsection  3.2.1.  The  resulting  text 
is  used  for  the  study  of  the  grammatical  structure  of  the  reporting  language  and 
for  system  development. 

•  Step  2.  For  purposes  of  demonstrating  the  system,  the  transcribed  text  is  input 
to  a  special  routine  which  places  all  events  in  the  future  and  randomly  replaces 
the  placeholders  by  either  totally  ficticious  names  or  by  names  referring  to 
objects  in  outer  space. 

This  method  preserves  the  syntactic  structure  of  the  original  language  for  linguistic 
analysis.  At  the  same  time,  it  provides  some  appearance  of  verisimilitude. 

Thus,  the  transcribed  sentence  (a)  below,  might  be  changed  into  sentence  (b): 

a.  MISCLASS  LAUNCHED  FROM  PLACENAME1  TO  PLACENAME2,  DAYNO  NMTH  4NMBR. 

b.  UM-67  ICBM  LAUNCHED  FROM  BETELGEUSE  TO  RIGEL  23  JULY  1987.* 

2.6.3  The  KWIC  Concordance.  Next,  the  sanitized  data  is  put  into  a  machine  readable 
form  and  processed  by  a  standard  KWIC  index  program,  which  yields  a  key-word-in- 
context  concordance,  orthographic  type  counts,  and  some  limited  statistics.  The  utility 
of  KWIC  concordances  and  associated  frequency  lists  both  as  an  aid  to  linguistic 
analysis  and  for  determining  priorities  in  the  parser  need  hardly  be  stressed.  An  excerpt 
from  a  KWIC  concordance  produced  for  the  Missile  and  Satellite  domains  is  shown  in 
Table  2-1. 

A  listing  of  each  word  form  in  its  context  permits  a  more  precise  characterization  of  the 
sublanguage  under  investigation  than  would  be  possible  otherwise.  By  its  very  form,  the 

*  Note:  M-67  is  the  name  of  a  galactic  cluster  In  the  Cancer  constellation.  UM-67  is 
a  fictitious  name.  Betelgeuse  and  Rigel  are  stars  in  the  Orion  constellation. 
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KWIC  index  facilitates  the  determination  of  the  word  classes  and  relations  characteristic 
of  a  particular  sublanguage,  and  thus  forms  the  basis  for  defining  both  the  scope  and  the 
vocabulary  specific  to  a  reporting  language  and  the  scope  of  the  sublanguage  grammar 
to  be  used  by  the  parser. 

The  three  procedural  steps  described  above  lay  the  foundations  for  the  conceptual 
analysis  of  the  source  data. 
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Table  2-1  Concordance  Program  Output 
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3.0  MISSILE  AND  SATELLITE  DOMAIN  DEFINITION 


This  section  begins  with  a  brief  introduction  of  the  missile  and  satellite  worlds,  and  con¬ 
tinues  with  a  detailed  description  of  the  two  domains  in  the  form  of  a  domain  definition. 

3.1  The  Missile  and  Satellite  Worlds 

The  missile  and  satellite  worlds  have  a  large  number  of  properties  in  common. 

Both  worlds  contain  political  entities  (e.g.,  U.S.,  USSR,  Uganda),  which  carry  out  inten¬ 
tional  actions  (e.g.,  they  launch  satellites,  or  missiles). 

Both  worlds  are  characterized  by  certain  human  actions  whose  instigators  normally 
remain  unidentified.  Thus,  objects  are  identified;  missions  are  assessed  or  confirmed; 
events  are  expected,  announced  or  confirmed;  actions  are  attempted. 

In  both  worlds  time  and  location  indications  are  of  crucial  importance. 

There  are,  however,  some  significant  differences,  due  to  the  differences  in  function 
between  missiles  and  satellites.  Satellites  but  not  missiles  are  launched  into  a  given 
orbit;  satellites  but  not  missiles  perform  orbital  maneuvers;  satellites  but  not  missiles 
have  well-defined  missions;  some  satellites  perform  manned  flights,  in  which  case  they 
may  rendezvous  and  dock  with  other  spacecraft;  finally,  satellites  but  not  missiles  are 
deorbited,  abandoned,  or  reactivated. 

Another  important  difference  concerns  the  time  span  associated  with  the  series  of 
actions  and  events  connected  with  a  launch-  Thus,  while  the  duration  of  the  chain  of 
events  associated  with  missile  launchings  is  measured  in  minutes,  that  of  events  associ¬ 
ated  with  satellite  launchings  is  usually  measured  in  days,  months,  or  even  years,  as  in 
the  case  of  Skylab. 

The  characteristic  objects  In  the  satellite  world  are  spacecraft.  As  mentioned  above, 


the  range  of  actions  a  spacecraft  normally  performs  is  wider  than  that  of  missiles. 
Spacecraft  can  be  launched,  placed  into  orbit,  deorbited,  and  recovered;  they  can  per¬ 
form  orbital  maneuvers,  rendezvous  and  dock  with  other  spacecraft,  they  can  deorbit 
themselves,  reenter  the  earth’s  atmosphere,  break  up,  burn  up,  crash  to  earth  and 
Impact.  They  can  be  manned  or  unmanned.  They  can  have  different  kinds  of  missions. 
Their  physical  properties  are  varied. 

Missiles  are  simpler  to  describe  and  the  number  of  atomic  events  they  can  participate 
in,  although  similar,  are  much  more  restricted. 

3.2  Domain  Definition 

3.2.1  Research  Corpus.  The  test  corpus  used  as  a  basis  for  the  developmental  work 
described  in  this  report  consists  of  280  messages  organized  into  EVENT  REPORTS. 

3.2. 1. 1  Source  Data  Sanitization.  Approximately  20%  of  the  messages  collected  were 
transcribed  according  to  the  sanitization  rules  discussed  in  subsection  2.5.2.  For  a  list¬ 
ing  of  the  sanitized  message  texts,  see  Appendix  A. 

The  list  of  placeholders  used  for  the  transcription  of  the  missile  and  satellite  test  corpus 
Is  given  below. 

Placeholder  List 

Note  that  all  propernames  used  in  this  list  are  taken  from  the  sources  listed  in  the  Fore¬ 
word  to  this  report. 

•  SPANAME  (spacecraft  name):  COSMOS-724,  COSMOS-651,  VOYAGER,  SOYUZ- 
21,  M0LNIYA-2A  PIONEER-1 1,  etc. 

•  SPATYPE  (spacecraft  type):  SOYUZ,  COSMOS,  PIONEER 

•  SPACLASS  (spacecraft  class):  ESV 

•  SPASTANAME  (space  station  name):  SALYUT-4 

•  MISNAME  (missile  name):  SS-N-10,  SS-14,  SS-N-6, 
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•  MISCLASS  (missile  class):  INTERMEDIATE  BALLISTIC  MISSILE,  ICBM,  SL5M, 
DRONE,  SAM 

•  LSTYPE  (launch  system  type):  SS-4,  SATURN-1  B 

•  PRESSNAME:  TASS 

•  PLACENAME:  TYURATAM,  PLESETSK,  THE  AUSTRALIAN  OUTBACK 

•  SEANAME:  INDIAN  OCEAN,  BLACK  SEA 

•  TRAC  (test  range  acronym): 

•  POLITNAME:  SOVIET  UNION,  USSR,  USA 

•  POUT  AD  J:  SOVIET 

•  NATNAME:  (Used  for  nationals  of  countries)  SOVIETS,  AUSTRALIANS 

•  PERSONNAME:  CHARLES  (PETE)  CONRAD,  YURIJ  ROMANENKO,  GEORGIJ  GRECHKO  * 

•  COORDINATES:  68-28N  46-29E,  49-33N  160-24E,  70N  3BE,  (Note  the  variety 
of  formats  used  for  the  expression  of  coordinates). 

•  ZULUTIME:  0940Z,  281600Z,  (Note  the  different  formats). 

•  1NMBR:  1,  2,  etc. 

•  2NMBR:  10,  11,  etc.  (and  similarly  for  3NMBR,  etc.  Note  that  1 NMBR  or  2NMBR 
might  designate  a  day  of  the  month,  and  that  2NMSR  or  4MMRR  might  re*c:  to  a 
year.) 

•  NMBRNAME:  SEVEN,  SIX,  etc. 

•  1NMBRTH:  1ST,  2ND,  etc.  (and  similarly  for  2NMBRTH,  etc.) 

•  NMBRNAMETH:  FIFTH,  SIXTH,  etc. 

•  RNMBR:  used  for  Roman  Numerals  like  I,  II,  V,  VII. 

•  NMTH:  used  for  name  of  month,  e.g.,  JANUARY,  NOVEMBER 

The  list  of  "placeholders"  remains  open-ended.  New  placeholders  are  incorporated  as 

required  by  the  material  under  study. 

3.2.1 .2  Examples  of  Transcribed  Text.  The  following  examples  show  sanitized  versions 

of  real  message  text. 

Example  1 

MISNAME  LAUNCHED  FROM  PLACENAME  2NMBR  MNTH  4NMBR. 

AT  ZULUTIME1,  A  MISNAME  WAS  LAUNCHED  FROM  PLACENAME1. 

IT  IMPACTED  NEAR  PLACENAME2  AT  ZULUTIME2. 


3-3 


Example  2 


PRESSNAME  ANNOUNCES  LAUNCH  OF  SPANAME. 

AT  ZULUTIME,  PRESSNAME-THE  POLITADJ  NEWS  AGENCY-ANNOUNCED  THE  LAUNCH  OF 
SPANAME,  THE  UNIDENTIFIED  MILITARY  SUPPORT  SPACLASS  WHICH 

WAS  LAUNCHED  FROM  THE  PLACENAME  MISSILE  AND  SPACE  CENTER  AT  ZULUTIME  TODAY. 
ORBITAL  PARAMETERS  AS  CONTAINED  IN  THE  ANNOUNCEMENT  ARE: 

APOGEE  NMBR  KILOMETERS 
PERIGEE  NMBR  KILOMETERS 
INCLINATION  NMBR  DEGREES 
PERIOD  NMBR  MINUTES 

The  transcription  procedure  is  very  time  consuming.  The  exercise  of  developing  a  saniti¬ 
zation  procedure,  however,  has  proved  useful.  It  has  provided  a  means  of  partitioning 
the  message  vocabulary  into  those  items  which  are  common  English  words  and  those  that 
are  specific  to  the  domain  (nomenclature,  specific  ways  of  referring  to  times  and  dates, 
geographic  locations,  etc.)  and  therefore  require  special  recognition  procedures  in  an 
operational  environment. 

The  set  of  messages  sanitized  according  to  the  above  procedure  under  this  contract  are 
contained  in  Appendix  A. 

3.2.2  The  Structure  of  EXTENT  REPORTS.  As  one  would  expect,  the  structure  of  the 
EVENT  REPORTS  in  the  two  domains  reflects  the  state  of  affairs  in  the  real  world.  EVENT 
REPORTS  describing  satellite-related  activities  consist  of  several  messages,  each 
describing  one  or  more  atomic  events. 

For  example,  in  the  satellite  domain,  a  first  message  always  describes  a  launch,  and  is 
invariably  followed  by  a  message  confirming  the  launch.  A  subsequent  message  reports 
the  announcement  of  the  launch  by  a  foreign  press  agency  and  usually  provides  details 
of  orbital  parameters.  The  foreign  press  announcement  is  optionally  followed  by  one  or 
more  messages  describing  related  events  such  as  orbital  maneuvers,  stabilization 
maneuvers,  a  rendezvous,  a  docking,  a  deorbit,  an  impact  and/or  a  recovery. 
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The  last  message  in  an  EVENT  REPORT  is  usually  a  summary  statement  describing  the 
global  event. 

EVENT  REPORTS  describing  missile  launching  and  related  events  are  much  more  compact. 
Often  the  launch,  the  reentry  and  the  impact  of  a  particular  missile  are  described  in  a 
single  message. 

This  structural  difference  has  implications  for  the  process  models  which  will  apply  to  the 
two  domains;  the  model  for  the  satellite  domains  will  be  much  more  complex  than  that 
for  the  missile  domain. 

3.2.?. 1  Message  Types.  In  order  to  characterize  EVENT  REPORTS  at  the  global  level, 
their  internal  composition  in  terms  of  message  types  was  studied. 

Individual  messages  were  classified  according  to  the  type  of  information  they  contain. 
As  pointed  out  in  subsection  2.3,  it  is  important  to  distinguish  between  those  messages 
that  refer  to  an  event  for  the  first  time,  and  those  that  confirm  events,  elaborate  on 
their  properties,  request  changes,  report  on  related  events,  offer  comments,  or  provide 
global  summaries  of  EVENTS. 

Several  distinct  message  types  were  identified: 

a.  Messages  describing  a  new  event. 

b.  Messages  confirming  an  event. 

c.  Messages  elaborating  on  the  parameters  of  a  previously  reported  event. 

d.  A  report  of  a  report,  e.g.,  a  message  reporting  on  a  foreign  news  agency  press 
release  concerning  an  event. 

e.  Data  Summary  Messages. 

f.  Closure  Messages 

g.  Follow  up  Messages. 

h.  Requests  for  changes  to  previously  reported  facts. 

The  message  types  studied  in  detail  under  this  contract  include  those  listed  under 
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a-d.  Types  e-h  are  characterized  by  an  abundance  of  evaluative  statements, 
hypothetical  statements,  and  statements  expressing  opinions  or  certain  inferences. 
The  current  study  was  limited  to  statements  describing  events  and  therefore 
excludes  message  types  e-h. 

3. 2. 2. 2  Event  Types  and  other  Concepts.  The  following  major  Event  types  were  identi¬ 
fied: 

a.  LAUNCH 

b.  DEORBIT 

c.  DOCKING 

d.  REENTRY  (AND  DECAY) 

e.  BREAKUP 

f.  IMPACT 

A  number  of  other  concepts  with  a  complex  internal  structure  were  identified: 

a.  Objects:  Missiles  and  Satellites 

b.  The  date  time  group  (DTG) 

c.  Orbital  Parameters:  Apogee,  Perigee,  Period  and  Inclination. 

The  relations  which  hold  between  events  in  the  missile  and  satellite  domains,  are  mainly 

temporal  relations  of  succession  in  time  and  relations  of  presupposition  and  entailment. 
A  spacecraft  which  is  in  orbit  now  must  have  necessarily  been  launched  some  time  pre¬ 
viously.  A  spacecraft  which  is  reentering  the  earth’s  atmosphere  will  break  up,  burn  up, 
or  crash  to  earth. 

A  complete  listing  of  the  templates  developed  for  the  event  types  and  other  concepts 
listed  above,  including  their  ancllliary  procedures  as  encoded  in  Prolog  is  given  in 
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The  DEORBIT  template,  as  encoded  In  Prolog,  Is  explicated  In  subsection  4.3.3. 

3.2. 2.3  Descriptor  System.  The  following  descriptors  were  Identified  for  the  Missile  and 
Satellite  Domains: 
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Table  3-1.  Missile  and  Satellite  Descriptor  System 


A.  Higher-Level  descriptors 
Infosource 


Status 

B.  Event-related  descriptors 
Agent 

Object 


Location 


Destination 


Mission 

Revolution 

C.  Orbit-related  descriptors 
Apogee 
Perigee 
Period 
Inclination 

C.  Launch-related  descriptors 
Launchsite 


Launchsystem 

D.  Date/Time  Group  Descriptors 
Time 

Date 

Duration 

E,  Object-related  descriptors 

Equipment  Class 

Setspecif ication 
Further  description 


The  source  of  a  report  of  an  event  ' 
e.  g. ,  a  foreign  news  agency  I 

Status  of  event  (expected  to  happen; J 
failed  to  happen;  confirmed)  1 

Animate  instigator  of  an  action.  J 

The  entity  that  moves  or  changes  or  . 
whose  position  or  existence  is  being 
described.  J 

The  location  of  the  object  at  some  ^ 
point  in  time.  | 

Projected  or  actual  destination  | 

of  the  object  at  the  end  of  the  j 

mission. 

Mission  of  satellite  I 

Usually  revolution  on  which  J 


Site  from  which  a  missile  or 
satellite  was  launched 

System  used  for  firing  missile 
or  satellite. 

Time  of  observed  event 
Date  of  observed  event 
Duration  of  an  event 

Spacecraft  or  missile  class 

Number  of  objects 

Usually  a  relative  clause 
or  appositive  clause 
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3.2.3  Language  Definition.  The  reporting  language  used  in  the  two  domains  has  special 
grammatical  and  lexical  properties  which  justify  calling  it  a  sublanguage. 


3.2.3. 1  Relationship  between  General  English  and  the  Reporting  Language.  The  report¬ 
ing  language  consists  of  declarative  sentences.  There  are  no  questions  or  commands. 
Furthermore,  the  reporting  language  is  characterized  by  grammatical  constructions  which 
deviate  from  those  of  "normal"  English,  (e.g.,  dropped  articles,  dropped  prepositions). 
Although  it  might  be  possible  to  first  recover  the  deleted  material  and  then  subject  the 
expanded  text  to  a  general  parsing  grammar,  it  turned  out  more  convenient  to  write  a 
specialized  grammar  stating  the  allowable  combinations  of  word  classes  directly.  Our 
number  of  rules  still  is  considerable  smaller  than  those  that  would  be  required  for  general 
English. 

As  an  example  of  idiosyncratic  vocabulary  usage,  consider  the  verb  "deploy".  In  ordi¬ 
nary  discursive  prose,  this  verb  is  normally  used  with  an  animate  Agent:  somebody 
deploys  something.  In  our  corpus,  inanimate  objects  like  aircraft  more  often  than  not 
deploy  themselves.  Constructions  with  the  Agent  expressed  in  surface  structure  occur 
less  frequently. 

3 .2.3.2  The  Grammar.  In  the  MATRES  II  System,  the  linguistic  structure  is  defined  by 
means  of  an  augmented  transition  network  grammar  in  terms  of  familiar  linguistic 
categories  such  as  sentence,  nounphrase,  verbgroup,  prepositional  phrase  and  adverb. 

In  order  to  expedite  processing,  a  number  of  language  specific  categories,  not  usually 
found  In  traditional  grammars,  were  added.  Thus,  the  familiar  definition  of  prepositional 
phrase  in  (a)  was  augmented  to  encompass  dates  (b): 

(a)  pp  -»  prep  +  nounphrase 

(b)  pp  -»  prep  +  date 
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where  ’date’  is  a  non-terminal  of  the  grammar  with  its  own  internal  structure. 

In  this  section  we  give  an  informal  description  of  the  major  grammatical  phenomena  which 
are  covered  by  the  grammar,  and  of  the  analyses  which  are  given  them  by  the  ATN 
parser. 

3.2.3.2.1  The  Declarative  Sentence.  The  only  class  of  sentences  handled  by  the  current 
version  of  the  grammar  are  declarative  sentences.  As  mentioned  above  there  are  no 
other  sentence  types  in  the  corpus. 

A  declarative  sentence  may  be  a  simple  sentence  as  in  (1),  a  complex  sentence  with 
embedded  nominal  clauses  as  in  (2)  and  (3),  or  sentences  with  adverbial  subordinate 
clauses  as  in  (4)  and  (5).  The  current  version  of  the  grammar  does  not  handle  coordina¬ 
tion. 

1.  SKYLAB  DEORBITED  OVER  CANADA. 

2.  TASS  ANNOUNCED  THAT  SKYLAB  DEORBITED  OVER  CANADA. 

3.  SKYLAB  FAILED  TO  IMPACT  IN  CANADA. 

4.  THE  SATELLITE  DEORBITED  AFTER  A  13  DAY  MISSION. 

6.  THE  SATELLITE  DEORBITED  FOLLOWING  A  13  DAY  MISSION. 

The  MATRES  II  grammar  analyzes  a  declarative  sentence  as  a  list  having  as  its  first  ele¬ 
ment  a  simple  sentence,  which  may  be  followed  optionally  by  a  sentence  conjunction, 
and  either  another  simple  sentence  or  a  noun  phrase. 

3.2.3. 2.2  The  Simple  Sentence.  A  simple  sentence  may  have  six  components,  of  which 
only  the  main  predicate  is  mandatory.  The  components  are:  voice  (active  or  passive), 
subject  (a  noun  phrase),  a  verb  group,  optionally  followed  by  a  direct  object,  a  comple¬ 
ment,  and  one  or  more  post-verb  modifiers. 
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The  grammar  analyzes  a  simple  sentence  as  a  six-branched  node  structure.  The  first 
branch  points  to  the  voice  node,  the  second  to  the  subject  node,  the  third  branch  to  the 
verb  group,  the  fourth  to  the  object,  the  fifth  to  a  complement,  and  the  sixth  to  a  list  of 
adverbial  modifiers. 

3.2.3.2.3  The  Noun  Phrase.  In  the  simplest  case,  a  noun  phrase  may  consist  of  a  pro¬ 
noun  or  a  proper  noun  optionally  followed  by  an  appositive  construction.  On  the  other 
extreme,  a  noun  phrase  may  consist  of  a  determiner  followed  by  a  list  of  pre-head 
modifiers,  a  head  noun,  and  a  list  of  post-head  modifiers. 

A  determiner  may  consist  simply  of  an  article  (e.  g.,  ‘THE’),  a  quantifier  (eg.  ‘ALL’),  or  a 
number  phrase  (eg.  ‘AS  MANY  AS  SIX’),  or  it  may  be  a  complex  structure  involving  two  or 
three  of  these  constituents,  as  shown  in  the  examples  below: 

ALL  THE  MISSILES 

ALL  SIX  MISSILES 

THE  SIX  MISSILES 

ALL  OF  THE  SIX  MISSILES 

Pre-head  modifiers  may  include  adjectives,  nouns,  past  participles,  and  present  partici¬ 
ples.  In  both  domains  analyzed,  head  nouns  are  typically  preceded  by  several  modifiers 
referring  to  various  attributes.  Example  (8)  is  taken  from  the  aircraft  domain,  while 
example  (9),  is  from  the  satellite  domain. 

(8)  RETURNING  UGANDAN  UBBC  SR- 71  AIRCRAFT 

(9)  A  FIRST  GENERATION  HIGH  RESOLUTION  PHOTOGRAPHIC  SATELLITE 

Possible  post-head  modifiers  are  relative  clauses,  reduced  relative  clauses,  apposltives, 


and  prepositional  phrases.  An  example  of  each  is  given  in  (10)  through  (13),  respec¬ 
tively. 

(10)  SKYLAB,  WHICH  WAS  LAUNCHED  FROM  THE  KENNEDY 
SPACE  CENTER  ON  14  MAY  1973,.... 

(11)  THE  SATELLITE,  LAUNCHED  FROM  THE  KENNEDY 

SPACE  CENTER  ON  14  MAY  1973, . 

(12)  THE  SKYLAB  ORBITAL  WORKSHOP,  A  CONVERTED  S-4B 

THIRD  STAGE  FROM  A  SATURN-5  LAUNCH  VEHICLE,  DEORBITED . 

(13)  THE  AIRCRAFT  FROM  ENTEBBE 

A  noun  phrase  is  analyzed  as  a  four-branched  node.  The  first  branch  points  to  a  deter¬ 
miner  (possibly  null,  as  in  (10)),  the  second  to  a  list  of  pre-head  modifiers,  the  third  to 
the  head  noun,  and  the  fourth  to  a  list  of  post-head  modifiers. 

The  current  version  of  the  grammar  only  allows  simple  noun  phrases  (i.e.,  those  without 
post-head  modifiers)  to  occur  as  direct  objects  or  prepositional  objects. 

3. 2.3. 2.4  Nominalizations.  The  following  constructions,  referred  to  as  nominalizations, 
are  very  frequent  in  our  research  corpus: 

a.  THE  DEORBIT  OF  SKYLAB 

b.  THE  IMPACT  OF  SKYLAB 

c.  THE  REENTRY  OF  SKYLAB 

These  nominalizations  are  parsed  as  noun  phrases  and  are  later  converted  into  proposi¬ 
tional  structures  in  the  ERL  "machine"  before  semantic  interpretation.  Nouns  like  DEOR¬ 
BIT,  IMPACT  and  REENTRY,  which  denote  events,  become  the  main  predicates  of  their 
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respective  propositions,  and  the  objects  of  the  preposition  ‘of’  become  the  logical  sub¬ 
jects.  Such  nouns  are  marked  by  the  feature  EVENTIVE  in  the  lexicon. 

3. 2.3. 2.5  Relative  Clauses.  Among  the  post  nominal  modifiers,  the  relative  clause  is  one 
of  the  most  frequent  constructions.  A  relative  clause  may  start  with  a  relative  pronoun 
(e.g.,  which,  that),  in  which  case  we  refer  Jo  it  as  a  "wh"  relative  (example  a),  or  it  may 
start  with  a  past  participle  (b)  or  a  present  participle  (c)  optionally  preceded  by  an 
adverb,  in  which  case  we  refer  to  it  as  a  reduced  relative. 

(a)  THE  UNIDENTIFIED  MISSILE,  WHICH  WAS  SOFTLANDED 
NEAR  THE  SEANAME  TODAY,  WAS  FIRED  FROM  PLACENAME. 

(b)  SATNAME,  THE  FIRST  GENERATION  HIGH  RESOLUTION 
PHOTOGRAPHIC  SATELLITE  LAUNCHED  FROM  PLACENAME 
ON  2NMBR  MONTHMAME,  WAS  DEORBITED  DURING 

ITS  3NMBRTH  REVOLUTION. 

(c)  THE  SPACECRAFT,  CARRYING  TWO  ASTRONAUTS, 

WAS  SUCCESSFULLY  INJECTED  INTO  AN  ORBIT 
INCLINED  2NMBR  TO  THE  EQUATOR. 

Relative  clauses  usually  describe  an  event  related  to  the  main  event  of  the  sentence. 
As  such  they  provide  links  to  previous  sentences  or  even  messages. 

In  the  current  version  of  the  system,  relative  clauses  are  parsed  as  sentences  and 
stored  in  the  postmodifier  list  of  the  head  noun.  The  interpretive  routines  recognize 
relative  clauses  and  store  them  as  a  unit  in  the  "Relative"  slot  of  event  templates.  A 
more  sophisticated  system,  however,  would  break  down  relative  clauses  and  build  event 
records  for  them,  which  would  be  suitably  connected  to  the  main  template.  This  was  not 
done  under  this  contract  simply  for  lack  of  time. 

The  current  version  of  the  grammar  also  allows  for  relative  clauses  with  a  sentential 
antecedent  as  in  the  sentence  below: 


THE  AIRCRAFT  WAS  APPARENTLY  POSITIONED  AT 
01 25N3470E,  WHICH  WOULD  PLACE  IT  APPROXIMATELY 
6  KMS  INSIDE  UGANDAN  AIRSPACE. 

Such  relatives  are  stored  in  the  post  modifier  list  of  the  main  verb.  The  following  exam¬ 
ple  illustrates  how  relative  clauses  are  stored  in  templates: 

(a)  THE  UNIDENTIFIED  MISSILE  ,  WHICH  WAS  SOFTLANDED  NEAR 
THE  SEANAME  TODAY,  WAS  FIRED  FROM  PLACENAME. 

Event:  LAUNCH 
Action=  FIRED 
Object:  MISSILE 

...Equipment3  UNIDENTIFIED  MISSILE 
...Number3 

...Relative3  SOFTLANDED  NEAR  THE  SEANAME  TODAY 
Launchsite3  FROM  PLACENAME 

3.2.3.2.G  Noun  Phrase  Apposition.  Noun  phrase  apposition  is  very  common  in  the  sub¬ 
languages  under  study. 

For  two  or  more  noun  phrases  to  be  appositives,  i.e.,  in  apposition,  they  musl  normally  be 
identical  in  reference  or  else  the  reference  of  one  must  be  included  in  the  reference  of 
the  other.  For  example,  in  (a),  THE  SOVIET  NEWS  AGENCY  and  TASS  refer  to  the  same 
organization. 

(a)  THE  SOVIET  NEWS  AGENCY,  TASS,  ANNOUNCED  THAT . 

The  semantic  relationship  between  the  two  noun  phrases  is  one  of  "appellation",  a  sub¬ 
class  of  the  more  general  "equivalence"  relationship  (c.f.,  Quirk,  1972).  With  "appella¬ 
tion",  both  noun  phrases  are  definite  and  the  second  is  typically  a  proper  noun. 

The  converse  of  "appellation"  is  "designation"  also  a  subclass  of  the  "equivalence" 
relationship.  With  Jesignation  —  as  in  the  case  of  appellation  —  both  appositives  are 
commonly  definite  <un  phrases,  but  here  the  second  appositive  is  less  specific  than  the 


3-14 


first,  as  illustrated  in  (b) 


(b)  TASS,  THE  SOVIET  NEWS  AGENCY,  ANNOUNCED  THAT . 

A  third  subclass  of  the  equivalence  relationship  is  that  of  "identification".  With  identifi¬ 
cation,  the  first  appositive  is  typically  an  indefinite  noun  phrase  and  the  second  apposi- 
tive  is  more  specific  as  in  (c)  below: 

(c)  AN  UNIDENTIFIED  MISSILE,  PERHAPS  A  DRONE, . 

Here,  there  is  no  longer  unique  equivalence  as  there  was  with  (a)  and  (b);  the  second 
appositive  identifies  --  often  only  tentatively  —  what  is  given  in  the  first  appositive. 

A  second  type  of  major  semantic  relationship  in  strict  non-restrictive  noun-phrase  appo¬ 
sition  is  that  of  "attribution".  "Attribution"  involves  predication  rather  than  equivalence. 
The  second  appositive  is  commonly  an  indefinite  noun  phrase  (although  it  can  also  be 
definite),  and  can  be  replaced  by  a  relative  clause: 

(d)  SPANAME,  A  LOW  RESOLUTION  PHOTOGRAPHIC  SATELLITE, 

LAUNCHED  FROM  TYURATAM  AT  OOOZ  ON  30  DEC  1955, . 


Figure  3-2  summarizes  the  semantic  relationships  in  the  types  of  appositives  identified 


so  far  in  the  three  subject  domains  analyzed  thus  far. 
+ - 


EQUIVALENCE 

ATTRIBUTION 


APPELLATION 

IDENTIFICATION 

DESIGNATION 


+ - + 

Figure  3-2.  Semantic  Relationships  in  non-restrictive 
noun-phrase  apposition. 

Because  all  the  subclasses  of  noun  phrase  apposition  found  in  the  aircraft,  missile,  and 
satellite  materials  to  date  are  such  that  the  second  appositive  can  be  replaced  by  a 
relative  clause,  it  was  decided  to  analyze  them  as  relative  clauses  in  the  ATN  grammar, 
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and  to  shift  the  burden  of  distinguishing  appositives  from  relative  clauses  to  the  inter¬ 
pretive  component  (the  ERL  "machine”)  --  if  such  distinctions  were  useful  in  an  an 
operational  environment. 


The  following  example,  taken  from  the  the  Missile  domain,  shows  how  a  sentence  con¬ 
taining  an  appositive  is  analyzed  by  the  current  version  of  the  MATRES  II  system. 

*»  AN  UNIDENTIFIED  MISSILE,  PROBABLY  A  MISNAME, 

*WAS  FIRED  FROM  PLACENAME  AT  APPROXIMATELY  ZULUTIME  TODAY. 

Event:  LAUNCH 
Action=  FIRED 
Object:  MISSILE 

...Equipment=  UNIDENTIFIED  MISSILE 
...Number= 

...Relatives  PROBABLY  A  MISNAME 

Laur.chsite=  FROM  PLACENAME 

Time=  AT  APPROXIMATELY  ZULUTIME  TODAY 

3. 2.3. 2.7  The  Verb  Group.  The  verb  group  may  consist  of  an  auxiliary  followed  by  a 
verb,  as  in  (14),  or  an  auxiliary  followed  by  a  copula  followed  by  an  adjective,  as  in 
(15). 

(14)  HAVE  BEEN  CONDUCTING 


(15)  HAVE  BEEN  ACTIVE 

In  (14)  the  auxiliary  is  'HAVE  BEEN’,  while  in  (15)  the  auxiliary  is  ‘HAVE’,  and  'BEEN’  is  the 
copula. 

Some  verbs  (eg.  ‘CONDUCT’,  ‘PENETRATE’)  must  be  followed  by  a  direct  object  consti¬ 
tuent,  which  is  another  noun  phrase.  Other  verbs  (eg.  ‘ARRIVE’)  never  have  a  direct 
object,  while  for  others  (eg.  ‘OPERATE’)  the  object  is  optional.  Such  verbs  ere  marked 
TRANS,  INTRANS  or  left  unmarked  in  the  lexicon. 

3. 2.3. 2.8  Aduerbials.  Adverbial  modifiers  include  prepositional  phrases  and  adverbs,  and 
may  occur  before  the  subject,  as  in  (16),  after  the  verb  (and  the  object,  if  there  is  one) 
as  in  (1  7),  or  embedded  within  the  verb  group,  as  is  the  case  with  ‘CURRENTLY’  in  (16). 
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(16)  AT  APPROXIMATELY  S  AM  ON  26  MAY  1973, 

AN  APOLLO  SPACECRAFT  CARRYING  THREE  ASTRONAUTS 
WAS  LAUNCHED  FROM  THE  KENNEDY  SPACE  CENTER. 

(1  7)  THE  SATELLITE  WAS  LAUNCHED  AT  APPROXIMATELY 
1330  HOURS  ON  14  MAY  1973. 

(18)  THE  SATELLITE  IS  CURRENTLY  ORBITING  OVER  CANADA. 

3. 2.3. 2.9  The  Concepts  of  Time  and  Space.  Two  of  the  key  concepts  in  intelligence 
reporting  normally  expressed  as  adverbials  are  the  concepts  of  "time''  and  "space". 

Knowledge  of  temporal  patterns  of  given  classes  of  events  and  time  order  relations 
between  event  classes,  coupled  with  location  data,  can  often  assist  the  analyst  in 
creating  an  overview  of  the  current  situation  and  in  predicting  possible  outcomes. 
Knowledge  of  all  events  that  occurred  at  about  the  same  time  or  within  a  given  time- 
span,  in  a  given  geographic  area,  can  greatly  enhance  the  analyst’s  capability  to  make 
accurate  assessments  of  world  situations. 

Time  references  are  particularly  important,  since  in  many  cases  they  provide  the  only 
explicit  link  between  the  various  messages  constituting  an  EVENT  REPORT  (i.e.,  all  mes¬ 
sages  referring  to  the  same  global  event). 

For  example,  consider  an  event  referring  to  the  launch  of  a  new  satellite.  A  first  mes¬ 
sage  may  report  the  observed  launching  of  an  "unidentified"  satellite.  Although  there  is 
no  name  available  for  the  satellite,  such  a  message  usually  gives  precise  information  as 
to  the  time,  date  and  site  of  the  launching.  A  subsequent  message  may  state  that  the 
satellite  launched  at  such  and  such  a  time  from  such  and  such  a  launchsite  is  now  con¬ 
firmed  in  orbit,  but  still  lack  information  on  the  class  of  the  satellite  launched.  The  mes- 


sage  identifying  the  satellite  by  name  is  sometimes  not  received  until  the  foreign  news 
agency  announces  the  launch,  or  when  the  satellite  is  otherwise  identified.  Notice  that 
up  to  that  point  the  only  available  link  between  the  received  messages  is  the  launch 
time  and  date  and  possibly  the  site. 

The  linguistic  realization  of  time  and  location  adverbials  in  our  particular  message  domain 
often  differs  rather  sharply  from  that  in  "normal"  English. 

Time  phrases  in  the  sublanguages  under  consideration  tend  to  occur  either  at  the  begin¬ 
ning  of  sentences  or  at  the  end.  In  addition,  they  have  special  characteristics  which 
render  them  easy  to  isolate  from  the  rest  of  the  sentence.  First,  they  usually  begin  with 
certain  prepositions  (e.g.,  at,  between,  during,  in,  on,  since,  from  ...  to,  until).  Second, 
they  necessarily  contain  one  or  more  of  a  set  of  words  designating  the  months  of  the 
year  (including  abbreviations);  numbers,  either  spelled,  in  numeric  form,  or  a  form  peculiar 
to  intelligence  reporting  (zulutime);  and  third,  expressions  like  "today",  "this  date",  this 
year".  The  prepositions  and  the  time  expressions  in  the  third  group  form  closed  sets 
and  can  be  easily  identified  on  the  basis  of  a  fairly  restricted  sample  for  any  given  class 
of  messages. 

The  following  are  some  examples  of  time  and  date  expressions  taken  *rom  the  intelli¬ 
gence  messages  reporting  on  aircraft  activities  and  missile  and  satellite  launchings: 

•  AT  0940Z;  AT  0940;  AT  094000Z;  AT  APPROXIMATELY  0940Z;  AT  ABOUT  0940Z 

•  AT  THE  SAME  TIME 

•  BETWEEN  0115  AND  0332Z;  BETWEEN  01 1  5-0332Z;  BETWEEN  01 1  5:0332Z 

•  BY  201600Z 

•  CURRENTLY 

•  DURING  THE  EARLY  0200Z  HOUR 

•  DURING  THE  MORNING  HOURS 

•  EARLIER;  EARLIER  TODAY 
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•  FROM  1 400-1 625Z 

•  SINCE  0119;  SINCE  ABOUT  01 19Z;  SINCE  APPROXIMATELY  0119 
Corresponding  date  expressions  are: 

•  DURING  THE  YEAR 

•  IN  1977;  IN  OCTOBER  1977;  OCTOBER  1977 

•  ON  12  DECEMBER;  ON  12  DECEMBER  1977;  ON  12/13  DECEMBER 

•  ON  THE  3RD;  ON  3RD  APR;  ON  THE  3RD  APR 

•  12  DECEMBER;  12  DEC  74;  12  DECEMBER  1974;  DECEMBER  1977 

•  THIS  DATE;  ON  THIS  DATE;  ON  THAT  DATE 

•  THIS  YEAR 

•  TODAY;  EARLIER  TODAY 

Date  phrases  are  analyzed  as  three  branched  nodes.  The  first  branch  points  to  the  day, 
the  second  to  the  month,  and  the  third  to  the  year  (the  third  is  often  null). 

3.2.3.2.10  Location  References.  Natural  language  expressions  describing  locations  are 
far  more  varied  than  expression  referring  to  time.  While  many  such  expressions  may 
only  be  identifiable  by  means  of  linguistic  analysis,  there  are  nevertheless  some  classes 
that  could  be  handled  by  a  finite  state  character  processing  algorithm.  Obvious  candi¬ 
dates  are  references  to  location  by  means  of  coordinates  (e.g.,  5026N7138E,  50-26N 
071-38E,  and  variations  thereof). 

3.2.3. 2  1 1  Complementation.  The  current  version  of  the  grammar  handles  two  types  of 
complements:  "that"-complements  and  "to"-complements,  the  only  types  found  in  the 
reporting  languages  under  investigation. 

Examples  of  'that’-complements  from  the  missile  and  satellite  corpus  are: 

a.  ...SUGGESTS  THAT  A  MALFUNCTION  MAY  HAVE  OCCURRED  AS  EARLY 
AS  2NMBR  MNTH. 

b.  ...SUGGESTS  THAT  NMBRNAME  WERE  CARRIED. 

C.  ...ANNOUNCED  THAT  SPANAME  WAS  LAUNCHED  FROM  THE  POLITNAME. 

d.  ...SUGGESTS  THAT  THE  MISSLE  WAS  A  MISNAME. 

e.  ...INDICATED  THAT  THE  SECOND  STAGE  ENGINE  FAILED  TO  IGNITE 

AS  PROGRAMMED  FOLLOWING  FIRST  STAGE  SHUTDOWN. 
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f.  ...INDICATIONS  ARE  THAT  THE  SECOND  STAGE  OF  THE  LAUNCH 

VEHICLE  FAILED  TO  IGNITE. 

g.  ...INDICATES  THAT  THE  SPACECRAFT  HAS  BEEN  ABANDONED. 

h.  ...INDICATED  THAT  THE  SPACECRAFT,  IF  SUCCESSFUL,  WOULD  HAVE 

BEEN  INSERTED  iNTO  AN  ORBIT  SIMILAR  TO  PREVIOUS  NAVSATS. 

I.  ...INDICATES  THAT  THIS  WAS  A  NOMINAL  MISNAME  MOD  1NMBR,  ... 

As  the  above  examples  show,  ‘that’-complements  are  complete  declarative  sentences 

preceded  by  the  word  ‘that’.  In  our  corpus,  they  are  found  in  object  position  of  verbs, 

as  well  as  after  certain  nouns.  The  parser  analyzes  ‘that’-complements  as  sentences 

and  stores  them  in  the  ‘Compl’  register. 

‘To’-complements  consist  of  the  ‘to’  marker  followed  by  the  first  verb  of  the  predicate, 
which  must  be  untensed.  Examples  of  ‘to’-complements  from  the  missile  and  satellite 
corpus  are: 

a.  ...FAILED  '.O  ACHIEVE  EARTH  ORBIT  AND  REENTERED. 

b.  ...WAS  TO  BE  INCUNEO  2NMBR  DEGREES  TO  THE  EQUATOR. 

c.  ...WAS  THE  NMBRNAMETH  MISNAME  TO  BE  LAUNCHED  FROM  PLACENAME 

THIS  YEAR. 

d.  ...APPEAR  TO  BE  PRESENTLY  CONDUCTING  A  SYSTEMATIC  CHECKOUT 

OF  THE  ON-BOARD  SYSTEMS. 

e.  ...WAS  ALLOWED  TO  CONTINUE  ITS  ORIGINAL  WESTWARD  DRIFT  IN 

SILENCE. 

f.  ...ATTEMPTED  TO  GEOPOSITION  THE  SATELLITE  ON  1NMBR  MNTH. 

g.  ...FAILED  TO  IGNITE. 

h.  ...FAILED  TO  IGNITE  AS  PROGRAMMED  FOLLOWING  FIRST  STAGE 

SHUTDOWN. 

i.  ...IS  EXPECTED  TO  RENDEZVOUS  AND  DOCK  WITH  THE  ORBITING  SPACE 

STATION  SPASTANAME. 

J.  ...WAS  APPARENTLY  INTENDED  TO  REPLACE  SPANAME. 

k.  ...SPANAME  1  WAS  LAUNCHED  TO  REPLACE  SPANAME2. 

l.  ...WILL  PROBABLY  ATTEMPT  TO  SEPARATE  THE  NUCLEAR  POWER  SUPPLY 

FROM  THE  MAIN  PAYLOAD. 

m.  ...ARE  CONTINUING  TO  SHIFT  WESTWARD  AT  ABOUT  NMBR  DEGREES  PER  DAY. 

n.  ...AFTER  FAILING  TO  STABILIZE  IT. 
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3.2.3.2.12  Passii/e  Sentences.  A  form  of  "be"  followed  by  a  past  participle  form  of  a 
verb  indicates  the  passive  construction.  Passive  constructions  are  “inverted"  so  that 
passive  and  active  forms  of  the  same  sentence  result  in  the  same  "deep"  structure.  In 
the  MATRES  II  grammar,  a  sentence  such  as  (24)  is  restructured  as  in  (25),  where  the 
surface  subject  becomes  the  object,  and  the  surface  object  becomes  the  subject. 

(24)  THE  SATELLITE  WAS  LAUNCHED  BY  NASA  ON  14  MAY  1973. 

(25)  NASA  LAUNCHED  THE  SATELLITE  ON  14  MAY  1973. 

3.2.3.2.13  Ambiguity.  Because  of  the  limited  semantic  domain,  the  possibilities  for 
word-sense  ambiguity  are  greatly  reduced. 

3.2.3.2.14  The  Paraphrase  Problem.  Since  we  are  dealing  with  a  restricted  subject 
domain,  we  are  not  confronted  with  the  whole  spectrum  of  difficulties  one  would  expect 
In  a  system  for  general  English.  Nevertheless,  we  do  have  to. deal  with  a  certain  amount 
of  paraphrase. 

Sentences  with  eventive  nouns  as  logical  subject  and  with  verbs  such  as  TAKE  PLACE, 
OCCUR,  or  forms  of  the  verb  BE  as  main  verb,  are  considered  paraphrases  of  sentences 
with  main  verbs  which  correspond  to  the  eventive  noun.  Two  examples  are  shown  in  26 
and  27  below.  Note  the  similarity  of  the  corresponding  event  records. 

Example  1 :  Impacted  vs.  Impact  (occurred/took  place/was... 

26.  (a)  SKYLAB  IMPACTED  IN  WESTERN  AUSTRALIA  JUST  SOUTHEAST 
OF  KALGOORLIE  ON  12  JULY  1979. 

(b)  IMPACT  OF  SKYLAB  OCCURRED/  TOOK  PLACE/  WAS  IN 
WESTERN  AUSTRALIA  JUST  SOUTHEAST  OF  KALGOORLIE 
ON  1 2  JULY  1 979. 
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Table  3-3  (a)  Event  Record  for  Sentence  26(a) 


+ - - + 

I  ! 

I  Event:  IMPACT  I 

I  Action=  IMPACTED  I 

I  Object:  SATELLITE  I 

I  ...Equipments  SKYLAB  I 

I _ Relative=  I 

I  Location=  IN  WESTERN  AUSTRALIA  JUST  SOUTHEAST  OF  I 


I  KALGOORLIE  I 

I  Date=  ON  12  JUL  1979  I 

I  I 

+ - f 

Table  3-3  (b).  Event  Record  for  Sentence  26(b). 

+ - i 


I  Event:  IMPACT  I 

I  Action:  IMPACT  I 

I  Object:  SATELLITE  I 

I  , . .  Equipments  SKYLAB  I 

I  ...Relatives  I 

I  Locations  IN  WESTERN  AUSTRALIA  JUST  SOUTHEAST  I 

I  OF  KALGOORLIE  I 

I  Date=  ON  12  JUL  1979  I 

I  I 

+ - + 


Example  2:  Reentered  vs.  Reentry 

27.  (a)  SKYLAB  REENTERED  THE  EARTH’S  ATMOSPHERE 
OVER  CANADA  ON  21  JUL  1979. 

(b)  REENTRY  OF  SKYLAB  TOOK  PLACE  OVER  CANADA 
ON  21  JUL  1979. 


Table  3-4  (a)  Event  Record  for  Sentence  27(a). 


I  Event:  REENTRY  I 
I  Actions  REENTERED  I 
I  Object:  SATELLITE  I 
I  ...Equipments  SKYLAB  I 
I  ...Relatives  I 
I  Locations  THE  EARTH’S  ATMOSPHERE  OVER  CANADA  I 
I  DATA=  ON  12  JUL  1979  I 
+ - + 
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Table  3-4(b)  Event  Record  for  Sentence  27(b). 

+ - + 

I  I 

I  Event:  REENTRY  I 

I  Action=  REENTRY  I 

I  Object:  SATELLITE  I 

I  ...  Equipment=  SKYLAB  I 

I  ...Relative=  I 

I  Location=  OVER  CANADA  I 

I  Date=  ON  12  JUL  1979  I 

I  I 

+ - + 

To  achieve  this,  the  sentences  with  the  eventive  noun  as  subject  are  first  restructured 
in  the  ERL  component.  Thus,  in  (27b),  the  eventive  noun  IMPACT  is  recognized  as 
expressing  the  main  predicative  concept  and  its  parse  tree  is  restructured  to  resemble 
that  of  (a):  the  surface  verb  "occurred",  "took  place"  or  "was"  is  replaced  by  the  lexi¬ 
cal  entry  for  IMPACT. 

3.2.3.2.15  Functional  Synonyms.  In  each  domain  there  are  a  number  of  verbs  which  are 
used  interchangeably(  e.g.,  "launch"  and  "  fire";  "land"  and  "impact").  These  are 
treated  as  functional  synonyms,  i.e.,  they  are  marked  in  the  lexicon  as  members  of  the 
same  class  of  event  related  concepts.  Similarly,  all  terms  identifying  members  of  a  mis¬ 
sile  class  are  marked  in  the  lexicon  by  the  superordinate  term,  and  processed  by  the 
template,  in  this  case,  the  MISSILE  template. 

Marking  functional  synonyms  in  the  lexicon  as  members  of  the  same  class  of  concepts, 
allows  retrieval  in  terms  of  event  or  object  classes,  rather  than  requiring  the  user 
analyst  to  think  of  all  possible  words  for  the  members  of  the  event  or  object  classes. 

The  two  examples  below  illustrate  event  records  derived  from  two  sentences  using  the 
functional  synonyms  "launch"  and  "fire".  Note  that  both  sentences  are  interpreted  as 
"launch"  events.  The  particular  verb  used  is  shown  In  the  Action  slot. 
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Table  3-4  (a).  Launch  vs.  Fire 


- - + 

I  I 
I  *>>  THE  SKYLAB  ORBITAL  WORKSHOP,  I 
I  *A  CONVERTED  S-4B  THIRD  STAGE  FROM  A  SATURN-5  I 
I  * LAUNCH  VEHICLE,  WAS  LAUNCHED  FROM  THE  KENNEDY  I 
I  »SPACE  CENTER  AT  1330  HOURS  ON  14  MAY  1973.  | 

I  Event:  LAUNCH  I 
I  Action=  LAUNCHED  I 
I  Object:  SATELLITE  I 
I  . . . Equipment=  SKYLAB  ORBITAL  WORKSHOP  i 
I  . . .  Relative=  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A  I 
l  SATURN- S  LAUNCH  VEHICLE  ) 

I  Launchsite=  FROM  THE  KENNEDY  SPACE  CENTER  I 
|  Location=  I 
I  Time=  AT  1330  I 
I  I 
+ - + 


Table  3-4  (b).  Launch  vs.  Fire 


- - - 

I  I 

I  *>>  THE  SKYLAB  ORBITAL  WORKSHOP,  A  CONVERTED  I 

I  #S-4B  THIRD  STAGE  FROM  A  SATURN-5  LAUNCH  VEHICLE,  I 

I  *WAS  FIRED  FROM  THE  KENNEDY  SPACE  CENTER  AT  I 

I  1330  HOURS  ON  14  MAY  1973.  I 

l  Event:  LAUNCH  I 

I  Action=  FIRED  I 

I  Object:  SATELLITE  I 

I  . . .  Equipment  =  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A  I 

I  SATURN-5  LAUNCH  VEHICLE  I 

I  Launchsite=  FROM  KENNEDY  SPACE  CENTER  I 

I  Location=  I 

I  Time=  AT  1330  I 

- - + 

We  are  aware  that  paraphrase  rules  may  lead  to  enormous  difficulties  in  a  system  for 
general  English.  However,  in  our  restricted  task  domain,  the  problem  seems  manageable. 
We  also  are  aware  of  the  fact  that  there  will  always  be  the  possibility  of  a  paraphrase 
that  was  not  anticipated,  as  well  as  other  cases  when  the  complexity  of  the  experimen¬ 
tal  input  exceeds  the  current  capabilities  of  the  model  of  the  system.  This  is  one  of  the 


reasons  why  MATRES  I  is  interactive. 


3.2.3.3  The  Lexicon.  The  MATRES  II  lexicon  is  specifically  designed  to  support  the 
grammatical  analysis  procedure,  and  is  therefore  intended  as  an  economical  rather  than 
an  exhaustive  inventory  of  feature  descriptions.  It  consists  of  two  parts:  a  listing  of 
the  features  or  attributes  employed  by  the  system,  and  a  collection  of  lexical  entries  in 
the  form  of  static  declarations  of  lexical  items  and  their  attributes. 


The  attributes  fall  into  several  classes.  Examples  of  each  are  given  beiow. 
(i)  Major  Grammatical  Category  Specifications. 

ADVB  (adverb) 

ADJ  (adjective) 

ART  (article) 

CONJ  (conjunction) 

NUM  (number) 

N (noun) 

VB  (verb) 

(ii)  Examples  of  Lexical  Features: 

COPULA 

DIR  (directional) 

EVENTIVE  (marks  eventive  nouns) 

INTRANS  (marks  intransitive  verbs) 

LOC  (locational) 

MODAL 

PASTP  (past  particile) 

PRESP  (present  participle) 

ROBJ  ("raise-object") 

RSUBJ  ("raise-subject") 

SUBNUM  (subordination  number) 

TENSED  (marks  tensed  verbs) 

TRANS  (marks  transitive  verbs) 

1NMBR,  2NMBR,  etc.  (marks  one-digit,  two-digit,  etc.,  numbers 


(Mi)  Event-Related  Features: 


EVENTS 

- +  - 

1 

OBJECTS 

+  - 

1 

OTHER 

CONCEPTS  1 

ACTIVITY 

1 

FLIGHT  | 

ACRAFT 

I 

I 

INCL 

1 

(inclination)  1 

ARRIVE 

IMPACT  | 

MISSILE 

1 

REV 

(revolution)  1 

DEORBIT 

LAUNCH  | 

SATELLITE  I 

COMM 

(communication)  1 

DEPART 

PENETRATE  I 

BOOSTER 

1 

1 

DEPLOY 

REENTRY  I 

1 

1 

ENROUTE 

RETURN  I 

- +  . 

1 

+  - 

1 

An  excerpt  from  the  current  lexicon  is  given  below: 


::  DELTA-CLASS  [  ADJ  ] 

DEORBIT  [  N  EVENTIVE  DEORBIT  ]  [  VB  DEORBIT  ] 

::  DEORBITED  [  VB  TRANS  PASTP  DEORBIT  ] 

[  VB  TRANS  TENSED  DEORBIT  ] 

::  DEPARTED  [  VB  TRANS  PASTP  DEPART  ] 

[  VB  TRANS  TENSED  DEPART  ]  . ; 
s:  DEPLOYED  [  VB  TRANS  PASTP  DEPLOY  ] 

[  VB  TRANS  TENSED  DEPLOY  ] 

::  DEPLOYMENTS  [  N  EVENTIVE  ] 

::  DESTINATION  [  N  ] 

::  DIVISION  [  N  ] 

::  DJIBOUTI  [  N  LOC  ] 

: :  DOWNED  [  VB  TRANS  PASTP  ]  [  VB  TRANS  TENSED  ]  . ; 
: :  DOWNRANGE  [  N  LOC  ]  . ; 

::  DURING  [  PREP  EMOD  TYME  ] 


The  entire  lexicon,  comprising  lexical  entries  for  all  three  domains  studied  thus  far,  is 
contained  in  Appendix  B. 


4.0  IMPLEMENTATION 


4.1  Principles  of  Discourse  Processing 

From  the  theoretical  viewpoint,  the  primary  goal  of  discourse  processing  is  to  arrive  at 
the  total  information  content  of  a  text,  where  the  total  information  content  of  a  text  is 
taken  to  be  the  aggregate  of  all  the  information  communicated  by  that  text,  including 
that  which  is  made  explicit  and  that  which  can  be  inferred  from  the  meaning  of  the 
words  appearing  in  that  text  and  their  syntactic  and  semantic  interrelations. 

A  human  analyst,  however,  is  selective.  He  does  not  seek  to  extract  all  the  information 
a  text  may  contain,  but  only  that  which  is  needed  for  the  performance  of  his  task. 
When  reviewing  a  message,  the  human  analyst  uses  his  innate  knowledge  of  English 
grammar,  as  well  as  his  extra-linguistic  knowledge  of  entities  such  as  spacecraft,  time, 
location,  and  actions  —  including  all  the  relevant  concepts  which  can  be  attributed  to  or 
are  implied  by  such  entities  —  and  extracts  only  those  information  items  which  are 
relevant  and  useful  to  the  attainment  of  his  current  goal. 

To  distill  information  elements  from  a  text,  the  computer  must  in  some  sense  model  the 
cognitive  processes  of  the  analyst.  It  must  take  into  account  what  is  known  about 
human  linguistic  behavior  --  how  the  analyst  interprets  text,  how  he  makes  inferences, 
how  he  remembers,  and  how  he  communicates. 

Fundamental  to  the  approaches  taken  in  this  field  is  the  assumption  that  a  person 
interpreting  natural  language  text  uses  much  more  than  just  the  information  contained  in 
isolated  words  and  sentences.  The  meanings  which  people  attribute  to  words  and  sen¬ 
tences  vary  not  only  according  to  their  linguistic  context,  but  also  with  the  subject 
matter  being  discussed  and/or  the  situation  in  which  they  are  used.  A  word  or  phrase  i» 
interpreted  in  terms  of  the  total  context  in  which  it  occurs;  it  is  the  knowledge  or  the 


total  context  which  enables  a  person  to  understand  language,  and  this  knowledge 
Includes  all  our  knowledge  about  the  real  world. 

Many  of  the  subprocesses  involved  in  language  understanding  are  still  largely  unex¬ 
plored  and  it  is  therefore  not  possible  to  construct  a  comprehensive  model  of  language 
understanding.  For  example,  no  specification  for  the  many  and  complex  inferential 
processes  involved  in  language  understanding  can  be  given  at  this  stage,  although  the 
points  in  the  understanding  process  at  which  they  operate  can  be  stated. 

The  OSI  natural  language  processing  system  is  based  upon  a  process  model  of  text 
understanding  involving  four  sets  of  operations. 

First,  the  sentences  of  a  message  text  are  parsed  into  a  set  of  propositinal  structures. 
The  propositions  are  linked  by  various  semantic  relations  which  may  be  explicitly 
expressed  in  the  surface  structure  of  the  text,  or  inferred  during  the  interpretation  pro¬ 
cess  on  the  basis  of  contextual  and/or  real  world  knowledge. 

Second,  the  resulting  set  of  propositions  are  organized  into  higher-level  conceptual 
categories,  namely,  event  representations. 

A  third  set  of  operations  links  the  resulting  event  representations  into  a  coherent  whole, 
reflecting  the  meaning  of  the  message  text  as  u  whole. 

Finally,  when  all  m  ^ss-ges  constituting  an  EVENT  REPORT  are  processed  in  this  manner,  a 
set  of  constraints  checks  the  coherence  of  the  set  of  messages  constituting  on  EVENT 
REPORT  at  the  global  level,  i.e.,  at  the  level  of  the  EVENT  REPORT. 

4.2  The  MATRES  II  Text  Processing  System 

4. P.1  General  Remarks.  The  principles  of  message  text  processing  disc -seed  ahove  are 
partially  implemented  in  the  MATRES  II  text  processing  system. 
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MATRES  II  is  the  result  of  the  second  cycle  in  the  development  of  a  system  with  full 


capabilities  for  deriving  formatted  records  from  the  narrative  text  of  intelligence  mes¬ 
sages.  It  represents  a  considerable  advance  on  MATRES  I,  which  provided  only  a  rudi¬ 
mentary  capability  for  testing  algorithms  for  narrative  text  analysis. 

The  two  subject  domains  of  MATRES  II  are  air  activities  and  missile  and  satellite  launch¬ 
ings.  While  in  a  fully  developed  system  the  unit  of  analysis  would  be  the  entire  mes¬ 
sage,  the  scope  of  the  current  system  is  limited  to  the  analysis  of  single  declarative 
sentences. 

The  definition  of  the  input  language  accepted  by  the  system  is  embodied  in  a  transition 
network  grammar  model  based  upon  Woods  (1970,  1973).  The  MATRES  II  parser  has 
undergone  considerable  refinement  and  expansion  and  currently  accepts  a  much  wider 
range  of  syntactic  constructions  than  was  previously  achieved. 

In  the  current  version  of  the  system,  English  language  words  are  entered  into  a  linguistic 
dictionary,  while  strings  with  fixed  patterns  are  recognized  at  the  input  stage  by  a  finite 
state  automaton  (FSA)  designed  especially  for  this  purpose. 

The  FSA  recognizes  strings  denoting  Zulutimes  (e.g.,  1907Z),  geographic  coordinates 
(e.g.,  3674N4261E),  integers  (e.g. 15,  1978),  and  ordinals  (2nd,  3rd,  25th).  Such 
strings  are  tagged  with  appropriate  features  at  the  input  stage  and  added  to  the  inter¬ 
nal  lexicon. 

The  major  feature  of  MATRES  II  is  its  capability  for  semantic  analysis.  This  is  achieved 
by  means  of  the  Event  Representation  Language,  which  Is  a  language  specially 
developed  for  mapping  the  syntactic  structures  produced  by  the  parser  into  template- 
derived  content  representations.  The  basic  data  structure  of  the  Event  Representation 
Language  Is  the  template. 
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The  current  version  of  the  system  takes  single  sentences  as  input  and  transforms  each 
into  one  or  more  event  records  by  combining  a  "bottom-up",  data-driven  analysis  based 


upon  linguistic  and  logical  principles  with  a  “top-down",  conceptually  driven  domain- 
specific  interpretation  of  the  structures  generated  by  the  input  analysis.  The  "bottom- 
up"  analysis  is  effected  by  the  augmented  transition  network  (ATN)  parser,  which  util¬ 
izes  a  dictionary  and  a  grammar  of  the  reporting  language  to  produce  a  parse  tree  show¬ 
ing  the  syntactic  composition  of  the  input  string  and  the  hierarchical  relationships 
between  component  structures.  The  output  of  the  parser  is  input  to  the  ERL  "machine", 
which  uses  a  set  of  prestored  templates  for  the  interpretation  of  the  input,  and  pro¬ 
duces  event  records  as  output.  These  event  records  constitute  the  primary  elements  for 
the  construction  of  the  "extensional"  data  base,  whose  purpose  is  to  serve  as  a  support 
tool  for  higher-level  analytical  functions  in  a  decision-making  environment. 

Figure  4-1  illustrates  how  the  program  reorganizes  the  components  of  —  in  this  case  -- 
a  hypothetical  sentence  to  give  a  clearer  presentation  of  its  information  content. 

The  computer  program  which  embodies  this  approach  to  natural  language  understanding 
Is  written  in  FORTH,  Prolog,  and  SN0B0L4,  and  runs  on  a  PDP  1 1  /45  under  the  RSX  1 1 D 
operating  system.  A  flow  diagram  of  the  MATRES  II  system  is  shown  in  Figure  4-2. 
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Input:  Unstructured  Text 

+ - + 

I  The  three  unidentified  heavy  bombers  which  flew  I 

I  from  London  to  Cairo  on  the  30  Apr  1975,  refuelled  I 

I  in  Naples  at  approximately  1300  hours  the  same  day.  I 

+ - + 


Output:  Event  Record 

+ - + 

I  Event  Type:  REFUEL  I 

I  Verb  used:  REFUELLED  I 

I  Object:  I 

I  ...Equipment:  UNIDENTIFIED  HEAVY  BOMBERS  I 

I  ...Set  specification:  THREE  I 

I  ...Relative:  WHICH  FLEW  FROM  LONDON  TO  CAIRO  I 

I  ON  THE  30  APR  1975  I 

I  I 

I  Location:  IN  NAPLES  I 

|  Time:  AT  APPROXIMATELY  1300  HOURS  I 

|  Date:  THE  SAME  DAY  I 

+ - + 

Figure  4-1.  Input/Output  Representations 

The  major  part  of  the  system  was  built  in  the  programming  language  FORTH,  which  is  an 
interactive,  incremental  system  with  a  low-level  semantics  which  the  user  can  easily  and 
quickly  extend.  This  allowed  the  rapid  development  of  the  ATN  language  and  control 
scheme,  as  well  as  the  support  scheme  for  the  execution  of  the  Event  Representation 
Language  (ERL)  algorithms,  a  formal  language  written  for  the  purpose  of  analyzing  text. 
The  ERL  algorithms  are  written  in  Prolog,  a  language  that  is  well  suited  to  the  specifi¬ 
cation  of  templates  and  the  algorithms  for  instantiating  them  in  ERL.  For  ease  of  imple¬ 
mentation,  the  compiler  for  the  subset  of  Prolog  utilized  in  this  application  was  written  in 
SNOBOL  4. 


The  use  of  FORTH  and  the  Prolog  formalism  allowed  fairly  easy  development  of  the  sys¬ 
tem  even  without  the  powerful  structure  manipulation  capabilities  of  a  language  like 


SP 
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Toward  the  end  of  the  l&W  Itl  contract,  It  became  clear  that  the  combination  of  the 
grammar  and  template  code  compiled  from  ERL  would  take  up  almost  all  of  the  available 
space  in  the  FORTH  dictionary,  leaving  very  little  working  room  for  sentence  processing. 
In  fact,  it  was  necessary  to  partition  the  templates  Into  separate  files,  and  process  sen¬ 
tences  using  different  templates  in  different  runs  of  MATRES;  even  then,  only  short  sen¬ 
tences  could  be  processed. 

To  remedy  this  difficulty,  a  scheme  was  developed  to  "overlay"  the  code  for  the  gram¬ 
mar  and  templates,  so  that  they  would  each  occupy  the  same  space,  but  at  different 
times,  since  parsing  and  template  matching  do  not  overlap  in  time.  To  do  this,  it  was 
necessary  to  design  a  scheme  to  allow  a  portion  of  the  FORTH  dictionary  to  be  saved  on 
disk,  and  the  dictionary  to  be  truncated.  The  saved  segment  could  later  be  brought 
back  into  memory  at  the  same  locations,  and  linked  into  the  dictionary  in  the  same  way 
as  when  it  was  saved.  Several  segments  could  be  created  and  saved  in  this  way,  start¬ 
ing  at  the  same  location,  and  could  be  restored  to  memory  one  at  a  time  as  needed. 

Several  FORTH  Words  were  defined  to  manage  this  operation.  SEGBASE  defines  the 
starting  location  of  a  set  of  segments;  every  Word  defined  after  an  invocation  of  SEG¬ 
BASE  belongs  to  the  first  segment  until  an  invocation  of  SEGSAVE.  SEGSAVE  takes  as  a 
parameter  the  name  given  to  the  SEGBASE,  and  stores  the  segment  on  disk  starting  at 
the  block  number  given  by  the  variable  DYNBAS,  which  it  updates  to  the  next  available 
block  for  the  next  segment;  it  also  truncates  the  dictionary  at  the  SEGBASE. 
Thereafter,  Words  are  defined,  making  up  the  next  segment,  until  the  next  call  of  SEG¬ 
SAVE.  In  this  way,  several  segments  are  defined.  When  it  is  in  order  to  use  a  particular 
segment,  SEGLOAD  is  called  with  the  block  number  of  the  desired  segment  as  a  parame¬ 
ter;  it  loads  the  block  starting  just  after  the  SEGBASE,  making  the  Words  in  the  segment 


available. 


Using  those  commands,  MATRES  was  rebuilt  to  store  the  finite-state  automaton  for  lexi¬ 
cal  lookup  and  the  ATN  grammar  in  one  segment,  and  the  ERL  machine  and  the  template 
code  in  the  other.  A  sentence  is  processed  by  loading  the  parse  segment,  parsing  the 
sentence  to  create  the  parse  tree,  loading  the  ERL  and  template  segment,  then  matching 
the  templates  with  the  parse  tree.  This  resulted  in  a  very  slight  slowdown  from  the  ear¬ 
lier  scheme,  but  has  allowed  more  templates  of  greater  complexity  to  be  loaded,  and 
much  longer  sentences  to  be  processed. 

The  ultimate  test  of  a  computational  system  for  understanding  natural  language  is  its 
success  in  performing  some  specific  task.  The  goals  set  out  for  the  current  project 
have  to  a  large  extent  been  met. 

4.2.2  Functional  Description.  An  overview  of  the  MATRES  II  system  organization  and 
data  flow  is  shown  in  Figure  4-2.  The  main  system  components  are:  the  Lexical  Unit 
Recognizer,  the  ATN  parser,  and  the  ERL  "machine".  The  direction  of  the  arrows  in  Fig¬ 
ure  4-2  indicates  the  general  flow  of  information  as  a  sentence  is  processed  through 
the  system.  The  main  stages  of  event  record  generation  are  shown  across  the  center 
of  the  figure.  Feeding  into  this  are  the  various  analysis  components,  each  compiled  from 
a  source  text  in  a  language  appropriate  to  the  component. 
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.  MATRES  II  System  Overview 


4.2.3  The  Event  Representation  Language  (ERL).  The  Event  Representation  Language  is 
a  formalism  which  was  developed  for  the  purpose  of  exploring  the  effectiveness  of 
"templates"  as  a  knowledge  representation  technique  with  which  to  build  systems  for 
message  text  analysis  in  support  of  l&W  functions. 

The  main  function  of  this  language  is  to  guide  the  mapping  process  which  converts  nar¬ 
rative  text  into  formatted  event  records. 

The  basic  data  objects  of  the  Event  Representation  Language  are  the  templates.  As 
mentioned  in  Subsection  2.2.3,  we  conceive  of  templates  as  active  memory  structures' 
which  embody  hypotheses  about  objects,  facts,  events,  processes,  operations,  pro¬ 
cedures  and  computations  required  to  characterize  the  links  between  form  and  content. 

The  Event  Representation  Language  is  implemented  in  a  subset  of  Prolog,  a  formalism 
using  a  clausal  form  of  logic  restricted  to  "Horn"  clauses  (Warren  et  a/.,  1977;  Pereira 
et  al.,  1978).  Horn  clauses  may  be  given  both  a  declarative  and  a  procedural  interpre¬ 
tation  and  are  therefore  well  suited  for  the  expression  of  concepts  in  ERL.  The  basic 
computational  mechanism  of  Prolog  is  a  pattern  matching  process  ("unification")  operat¬ 
ing  on  general  record  structures  ("terms"  of  logic). 

Prolog  was  initially  developed  at  the  University  of  Marseilles  (Roussel  1975)  as  ’  practi¬ 
cal  tool  for  ’logic  programming’  (Kowalski  1974;  Coimerauer  1975;  van  Emden  1975),  and 
has  since  been  used  in  several  other  centers  (Stanford,  Edinborough)  for  writing 
language  analysis  systems  (Dahl  1977;  Warren  1977a,  Warren  1977b). 

Prolog  is  a  perspicuous  and  powerful  language  for  the  expression  of  the  concepts  of  the 
Event  Representation  Language,  and  admits  of  an  effective  and  reasonably  efficient 
Implementation.  Clear,  readable,  concise  programs  can  be  written  quickly  and  with  few 
errors.  Specifically,  the  following  features  make  it  particularly  suitable  for  our  purposes: 
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a.  Pattern  matching  (unification)  replaces  the  conventional  use  of  selector  and  con- 
t  structor  functions  for  operating  on  structured  data. 

b.  The  arguments  of  a  procedure  can  serve,  not  only  for  it  to  receive  one  or  more 
values  as  input,  but  also  for  it  to  return  one  or  more  values  as  output.  Procedures 
can  thus  be  "multi-output"  as  well  as  "multi-input". 

c.  The  input  and  output  arguments  of  a  procedure  do  not  have  to  be  distinguished  in 
advance,  but  may  vary  from  one  call  to  another.  Procedures  can  thus  be  "multi¬ 
purpose". 

d.  Procedures  may  generate  (via  backtracking,  in  the  case  of  Prolog)  a  set  of  alterna¬ 
tive  results.  Such  procedures  are  called  "non-determinate".  Backtracking  amounts 
to  a  high-level  form  of  iteration. 

e.  Procedures  may  return  "incomplete"  results,  i.e.,  the  term  or  terms  returned  as  the 
result  of  a  procedure  may  contain  variables,  which  are  only  filled  in  later  by  calls  to 
other  procedures.  The  effect  is  similar  to  the  use  of  assignment  in  a  conventional 
language  to  fill  in  fields  of  a  data  structure.  Note,  however,  that  there  may  be  many 
occurrences  of  an  instantiated  variable,  and  that  all  of  these  get  filled  in  simultane¬ 
ously  (in  a  single  step)  when  the  variable  is  finally  instantiated.  Note  also  that 
when  two  variables  are  unified  together,  they  become  identified  as  one.  The  effect 
is  as  though  an  invisible  pointer,  or  reference,  linked  one  variable  to  the  other.  We 
refer  to  these  related  phenomena  as  the  "logical  variable". 

f.  "Program"  and  "data"  are  identical  in  form.  A  procedure  consisting  solely  of  unit 
clauses  is  closer  to  an  array,  or  table  of  data,  in  a  conventional  language. 


4.2.4  The  ERL  Control  Mechanism.  Prolog  provides  a  remarkably  simple  form  of  control, 
which  suffices  for  many  practical  applications. 

The  declarative  semantics  of  Prolog  clauses  Is  such  that  the  order  of  the  goals  in  a 
clause  and  the  order  of  the  clauses  themselves  are  both  irrelevant  to  the  declarative 
interpretation.  However,  these  orderings  are'generally  significant  in  Prolog,  as  they  con¬ 
stitute  the  main  control  information. 

When  the  Prolog  system  is  executing  a  procedure  call,  the  clause  ordering  determines 
the  order  in  which  the  different  entry  points  of  the  procedure  are  tried.  The  goal  order¬ 
ing  fixes  the  order  in  which  the  procedure  calls  in  a  clause  are  executed.  The  ’produc¬ 
tive’  effect  of  a  Prolog  computation  arises  from  the  process  of  ’matching’  a  procedure 
call  against  a  procedure  entry  point. 

Prolog  has  captured  the  imagination  of  many  workers  in  natural  language  processing  and 
advanced  data  base  management,  and  promises  to  be  one  of  the  major  languages  of  the 
future. 

4.2.5  Advantages  of  Prolog  Representation.  This  representation  has  several  advan¬ 
tages,  among  which  we  might  mention  the  following  two:  (1)  if  additional  information 
needs  to  be  associated  with  a  particular  predicate,  this  can  be  done  simply  by  adding 
another  clause;  and  (2),  Prolog  provides  a  uniform  way  of  representing  structures  and 
processes  at  several  levels  of  grammatical  description:  syntactic  structures,  syntactic 
normalization,  description  of  objects,  description  of  events,  and  description  of  text-level 
relations. 

Recent  investigations  reported  in  the  literature  show  that  Prolog  Is  not  only  used  for 
grammatical  description  of  structures  and  processes  of  natural  language,  but  can  also  be 
used  as  a  practical  tool  and  unifying  principle  for  the  description  and  manipulation  of 


data  bases. 


4.3  Illustration  of  the  Understanding  Process 

The  next  three  subsections  give  examples  of  the  various  procedural  steps  involved  in 
the  processing  of  narrative  text,  and  show  how  the  goals  set  out  in  Section  1  are 
approximated. 

4.3.1  inputting  a  Sen*ence.  The  understanding  process  begins  when  a  sentence  is  input 
to  the  system.  This  is  currently  done  either  from  a  terminal,  or  from  a  disc  file.  A  sen¬ 
tence  is  input  in  the  format  ’>>  Sentence.’.  This  is  illustrated  below: 

(a)  »  THE  SKYLAB  ORBITAL  WORKSHOP, 

A  CONVERTED  S-4B  THIRD  STAGE  FROM  A  SATURN-5 
LAUNCH  VEHICLE,  SUCCESSFULLY  DEORBITED  INTO  THE 
AUSTRALIAN  OUTBACK  ON  1 2  JUL  1 979. 

The  input  sentence  is  received  by  the  Lexical  Unit  Recognizer,  which  uses  a  stored  dic¬ 
tionary  and  the  FSA  Recognizer  to  transform  the  individual  words  of  an  input  sentence 
into  a  string  of  lexical  units.  First,  a  dictionary  look-up  process  replaces  words  and 
phrases  in  the  sentence  with  corresponding  lexical  entries. 

Table  4-3  shows  the  lexical  entries  for  the  words  "deorbited",  "Australian",  "outback", 
and  "launched".  The  entire  lexicon,  comprising  lexical  entries  for  all  three  domains,  is 


contained  in  Appendix  B. 


Table  4-3.  Sample  Lexical  Entries 


r 


\ 


+ - + 

I  I 

I  ::  DEORBITED  [  VB  TRANS  PASTP  DEORBIT  ]  | 

I  [  VB  TRANS  TENSED  DEORBIT  ]  | 

I  I 

I  ::  AUSTRALIAN  t  ADJ  NATION  ]  I 


I  ::  OUTBACK  [  N  LOC  ]  . ;  I 

I  I 

I  ::  (  LAUNCH  VEHICLE)  [  N  BOOSTER  }  .;  I 

+ - + 


Strings  which  have  no  entries  in  the  dictionary  are  passed  to  the  FSA  Recognizer,  which 
attempts  to  identify  them  as  one  of  several  fixed-pattern  categories,  (numbers,  Zulu- 
times,  geographic  coordinates).  In  the  case  of  input  sentence  (a),  the  strings  ’12’  and 
’1979’  are  recognized  as  numbers  and  tagged  with  the  features  ’N’  for  noun,  and  2DIG 
and  4DIG,  indicating  that  they  are  two-digit  and  four-digit  numbers  respectively. 

4.3.2  Parsing  a  Sentence.  One  of  the  major  problems  in  constructing  automated 
language  understanding  systems  is  that  of  transforming  the  input  string  of  words  into  a 
canonical  form  which  permits  semantic  interpretation.  Part  of  the  transformation  process 
involves  syntactic  analysis.  The  main  purpose  of  automated  syntactic  analysis  is  that  of 
determining  the  logical  structure  of  input  sentences  (or  larger  text  units).  In  the  course 
of  constructing  a  structural  description,  a  syntactic  analyzer  generally  ’regularizes’  sen¬ 
tence  structure,  e.g.,  it  converts  sentences  and  parts  of  sentences  which  have  proposi¬ 
tional  structure  into  a  canonical  form.  For  example,  in  the  current  version  of  the  system 
passive  sentences  are  converted  into  their  active  form  by  the  syntactic  component. 

Automated  syntactic  analysis  involves  the  operation  of  parsing*.  The  current  version  of 

the  OSI  system  employs  an  Augmented  Transition  Network  parser  (sentence*  acceptor), 

*  A  parser  is  a  formal  algorithm  which  fulfills  two  functions.  One,  it 
takes  a  grammar  and  a  lexicon  and  decides  whether  a  sequence  of  words 
is  a  sentence  with  respect  to  that  grammar  and  second,  it  builds  a 
structural  representation  for  that  sequence  of  words. 
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and  is  designed  to  accept  messages  from  the  aircraft  activities  and  missile  and  satellite 
domains.  It  produces  a  propositional  representation  which  is  fairly  close  to  surface 
structure. 


Let  us  now  return  to  the  input  sentence  under  discussion.  After  lexical  lookup,  the 
string  is  input  to  the  augmented  transition  network  (ATN)  parser,  which  analyzes  it 
according  to  the  sublanguage  grammar  stored  in  the  system.  Roughly  speaking,  the  ATN 
processor  takes  the  string  of  lexical  entries  derived  from  the  sentence,  combines  them 
into  phrases,  and  determines  the  logical  relationships  that  hold  among  them.  In  this 
phase  of  processing  the  focus  is  on  comparatively  local  and  superficial  aspects  of  sen¬ 
tences  such  as  word  order,  and  the  invariant  properties  of  words  stored  in  the  lexicon. 
The  internal  representation  of  the  parse  tree  for  sentence  (a)  is  shown  in  Table  4-4. 
Table  4-4.  Parse  Tree  For  Sentence  (a) 


PARSE  output: 

LIST  of: 

!  node:  1 ! s 
i  :  list  of: 
i  :  node:  2 :  pp 
III!  node:  4 ! DATE 
!  !  !  :  :  652.,  1979 

:  !  :  !  !  630..  JUL 

!!l!!  60S..  .1.2 
:  !  i  :  END  NODE 

III!  586,,  ON 

i  :  i  i  list  of: 

!  :  !  :  END  LIST 

!  I  !  END  NODE 

:  ;  i  node:  2 : pp 

i  :  i  :  node:  2:np 

:  i  :  i  i  list  of: 

;  :  i  ;  :  end  list 

:  :  :  :  i  node:  5 innod 
1  :  :  1  ;  1  «nil» 

III!!:  564..  OUTBACK 
I  I  I  I  I  END  NODE 

I  III!  LIST  of: 

I  I  I  I  I  I  542..  AUSTRALIAN 
I  I  I  I  I  END  LIST 

1  1  1  I  I  node:  2 1  DP 

:  1  I  1  I  I  list  of: 


4 


:  :  :  :  52o..  the 
:  :  :  :  list  of: 

III!  END  LIST 
!  I  I  END  NODE 
I  END  NODE 
:  :  498 » .  INTO 

I  I  LIST  of: 

:  !  END  LIST 

I  END  NODE 
END  LIST 
NIL  :■•> 

NIL  > 

NODE:  2100 
I  404 . .  D EOF BI  T  ED 

:  «nil» 

I  LIST  of: 

I  END  LIST 

:  list  of: 

I  I  432.,  SUCCESSFULLY 
I  END  LIST 
END  NODE 
NODE :  2 : NP 
I  LIST  of: 

I  I  LIST  of: 
i  i  :  node:  i is 
iii:  list  of: 

I  I  I  I  END  LIST 
I  I  I  I  «NIL» 

I  I  I  I  NODE:  2 1 NP 
I  I  I  I  I  LIST  of: 
iii::':  node:  2ipp 
i  :  :  i  i  :  ;  node:  2 if 


;  node:  2 I NP 
:  :  list  of: 

I  I  END  LIST 

i  :  node:  5 innod 

I  I  I  «NIL.» 

I  I  I  410..  LAUNCH  VEHIC 
I  I  END  NODE 
I  I  LIST  OF? 

I  I  I  388..  SATURN-5 
I  I  END  LIST 
I  I  NODE?  2  I  DP 
I  I  I  LIST  of: 

I  I  !  END  LIST 
I  I  I  366..  A 
I  I  I  LIST  of: 

I  I  I  END  LIST 
I  I  END  NODE 
I  END  NODE 
I  344..  FROM 
I  LIST  of: 

I  END  LIST 
END  NODE 
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:  :  t  :  i  :  i  end  list 
i  i  i  i  :  :  :  node:  5  innod 
::::::::  «nil» 

:  i  :  :  :  :  :  :  322..  stage 

:::::::  end  mode 
i  :  i  ;  :  :  :  list  of: 

:  :  :  i  :  :  :  :  300 , ,  third 

::::::::  278.,  s~4B 

256.,  converted 
::::::  :  end  list 
:  :  1  :  :  :  :  node:  2: dp 

::::::::  list  of? 

end  list 

::::::::  234 , ,  a 
::::::::  list  of: 

end  list 

!!!!!!!  END  NODE 

::::::  end  node 
node:  2 1 vg 

:::::::  <<n i l>> 
:::::::  «n il» 
list  of: 
:::::::  end  list 
list  of: 
:::::::  end  list 
end  node 
::::::  «nil.» 

::::::  node:  4 : actv 
:::::::  «nil..» 
::::::  end  node 
:  :  :  :  :  end  node 
:  :  :  end  list 

i  :  1  end  list 
:  i  :  node:  s innod 
:  :  :  :  «nil» 

:  i  :  i  2 :l 2 , .  orbital  workshop 
:  :  :  end  node 
i  !  :  list  of: 

:  :  :  :  i?o..  skylab 
:  :  :  end  list 
:  :  :  node:  2: dp 
:  :  :  :  list  of: 

:  :  :  :  end  list 
:  :  :  i  168..  the 
i  i  :  :  list  of: 

:  :  :  :  end  list 

!  !  !  END  NODE 

:  :  END  NODE 

:  :  node:  4:actv 
:  :  ;  «n il.» 

!  1  END  NODE 

I  END  NODE 
END  LIST 


4.3.3  Interpreting  the  Parse  Tree.  A  parser  is  a  fairly  complex  mechanism,  and  it  is 
therefore  unwise  to  burden  it  with  all  the  operations  required  during  the  syntactic 
analysis  phrase.  Some  of  the  more  complex  regularizing  functions  are  therefore 
offloaded  onto  the  ERL  component. 

Syntactic  normalizing  procedures  convert  intermediate  tree  structures  generated  by  the 
syntactic  processor  into  deep  structure  trees  for  logical  semantic  analysis  and  event 
record  synthesis. 

The  processes  relevant  here  include  filling  in  elements  which  are  missing  from  the  sur¬ 
face  structure,  resolving  syntactic  ambiguities,  replacing  moninalizations  with  their 
corresponding  verbal  constructions,  and  generally  rearranging  the  elements  in  a  sen¬ 
tence  to  regularize  its  structure. 

In  the  current  version  of  the  system,  all  but  the  passive  restructuring  rule  are  incor¬ 
porated  into  the  ERL  formalism,  and  are  intermingled  with  the  other  interpretive  rules. 

The  input  sentence  under  discussion  does  not  require  restructuring. 

As  explained  in  a  previous  section,  the  ERL  semantic  interpretation  rules  (clauses)  are 
used  top-down,  one  at  a  time.  Goals  in  a  clause  are  executed  from  left  to  right.  If  there 
are  alternative  clauses  at  any  point,  backtracking  will  return  to  them. 

The  parse  tree  shown  in  Table  4-4  is  input  to  the  ERL  "machine",  which  uses  the  pattern 
matching  process  ("unification  mechanism  ’)  of  the  Event  Representation  Language  to 
produce  a  set  of  one  or  more  event  records  representing  the  information  content  of  the 
Input  sentence. 

As  a  first  step  in  the  interpretation  process,  the  system  activates  the  system  generated 
goal  ’do’,  which  is  currently  the  top-levei  procedure. 
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The  Top-Level  Procedure 


+ - ^ 

I  do ( [X, Y, Z] ): -  build_ER (X, Y, Z, ER) ,  type_ER  (ER) .  I 

I  do([Tree]):_  build_ER  (Tree,  ER) ,  type_ER(ER).  I 

I  do([Tree]):_  build_ERl  (Tree,  ER) ,  type„ER  (ER) .  I 

I  I 

+ - H 

The  input  structure  unifies  with  the  head  of  the  third  clause,  giving  rise  to  two  subgoals: 
the  buiid_ER1  procedure,  and  the  type_ER  procedure.  The  ’build_ER1  procedure  is 
illustrated  below: 

The  ‘build_ER1’  Procedure  for  Simple  Sentences 


+■ - - - -i 

I  build_ERl  (s (Voice,  Subj 1, Vbgr 1,  Obj,  Compl,  Vmods) ,  temp (Name,  ER) ):  -  I 


I  changel  (Subj 1, Subj 2,  Vbgr 1,  Vbgr2) ,  I 

I  f Ind_t_name  (Vbgr 2, Name) ,  I 

I  construct  (Name, s  (Voice,  Subj 2,  Vbgr2,  Obj,  Compl,  Vmods) ,  ER) .  I 

I  I 

+ - 

Since  the  input  structure  under  discussion  does  not  require  any  restructuring,  ’change  1’ 
leaves  it  unchanged.  Next,  ’find_t_name’  identifies  the  name  of  the  template  which  is 
to  be  used  for  interpreting  the  current  input.  The  relevant  information  is  found  in  the 
main  verb  of  the  sentence.  Thus,  sentence  (a),  whose  main  verb  is  "deorbit",  will  cause 
the  DEORBIT  template  to  be  activated. 


The  DEORBIT  template  as  coded  in  Prolog  is  shown  below: 


DEORBIT  Template 


I  construct  ('DEORBIT ' , s  (Voice, Subj,  Vbgr,  Ob j,  Compl ,  Vmods) , 
I  [Verb,  AG, OBI, Loc,Rev,  DTGJ):  - 

I  verb  (Vbgr,  Verb) , 

I  agent  (Subj, Vmods,  AG) , 

I  object  (Subj, Obj, OBI) , 

i  location  (Obj, Vmods,  Loc) , 

I  revolution  (Vmods,  Rev) , 

I  construct  (' DTG', Vmods,  DTG) . 
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All  templates  are  encoded  as  Prolog  "construct"  clauses.  The  head  of  a  "construct" 
clause  has  three  arguments:  a  template  name,  the  name  of  the  syntactic  constituent 
which  serves  as  the  context  which  is  searched  in  an  attempt  to  find  fillers  for  the 
descriptor  slots  of  the  template  in  question,  and  a  third  argument  which  represents  the 
output  of  the  procedure,  i.e.,  the  instantiated  slots. 

The  body  of  a  ’construct’  clause  consists  of  several  ’goals’,  corresponding  to  the  ’slots’ 
of  a  template.  In  the  case  of  the  DEORBIT  template,  the  body  of  the  "construct"  clause 
consists  of  six  "goals",  each  defined  as  a  procedure  which  actively  seeks  suitable  fill¬ 
ers  for  the  descriptor  slot  it  represents. 

In  the  case  of  the  example  input  sentence,  each  of  the  six  "slots"  actively  searches 
the  specified  context  in  an  attempt  to  find  a  component  which  can  serve  as  a  "filler". 
Since  there  is  no  Agent  specified  in  the  input,  the  Agent  slot  returns  ’nil’.  Next,  the 
‘object’  slot  constructs  a  record  for  the  subject  nounphrase,  which  It  decomposes  into 
an  ’Equipment’  component,  and  a  ’Relative’  component. 

The  location  clause  illustrated  below  identifies  the  prepositional  phrase  "INTO  THE  AUS¬ 
TRALIAN  OUTBACK",  as  the  location  of  the  deorbit. 

The  ’revolution’  procedure  also  fails  to  find  a  filler,  and  returns  ’nil’.  The  DTG  procedure 
cannot  find  a  filler  for  Its  Time  component,  but  identifies  the  prepositional  phrase  ON  1 2 
JUL  1 979  as  the  deorbit  date. 

The  result  of  the  Interpretive  process  is  the  "instantiated"  template  —  or  Event  Record 
—  shown  In  Table  4-5. 
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The  ’Location’  Clause 


+ - + 

I  location (NP,  List,  slot ('Locations', X)) : -  I 

I  locatl  (NP,  XI),  I 

|  searchlist (List,  X2) ,  I 

|  concatenate  (XI,  X 2,  X) .  I 

I  locatl  (NP,  [NP]):-  test_nhead(NP, 'LOC').  I 

I  locatl (_,  nil).  I 

I  searchlist  ( [M, ..  List] ,  [X, .. L] ): -  I 

|  searchloc (M,  X) ,  searchlist (List,  L) .  I 

I  searchlist ([_,  ..List],  L) :  -  searchlist (List, L) .  I 

I  searchlist (_, nil).  I 

I  searchloc (pp  (LI,  Prep,  NP) ,  [LI,  Prep,  NP] ): -  I 

I  member (P,  ['ALONG',  'AT',  'EAST  OF',  'IN',  'INTO',  I 

I  'NEAR', 'ON', 'SOUTHEAST  OF',  'OUTSIDE  OF’,  I 

|  'WEST  OF’]),  I 

I  lexeq  (Prep, P) ,  test_nhead  (NP, ' LOC') .  I 

I  I 

+ - + 


Table  4-5.  Event  Record  for  Sentence  (a). 

+ - + 

I  Event:  DEORBIT  I 

|  Actions  SUCCESSFULLY  DEORBITED  I 

|  Object:  SATELLITE  I 

I  . . . Equipments  SKYLAB  ORBITAL  WORKSHOP  I 

I  ...Numbers  I 

I  ...Relatives  A  CONVERTED  S-4B  THIRD  STAGE  FROM  A  SATURN-5  LAUNCH  I 

I  VEHICLE  I 

I  Locations  INTO  THE  AUSTRALIAN  OUTBACK  I 

I  Date=  ON  12  JUL  1979  I 

+ - + 

As  mentioned  before,  Event  Records  are  template-derived,  event-centered  data  struc¬ 
tures  in  which  the  information  conveyed  by  the  input  can  be  viewed  from  the  perspec¬ 
tive  of  time,  location,  type  of  activity,  object(s)  involved,  etc.  They  provide  content 
representations  for  individual  sentences  describing  atomic  events,  and  form  the  building 
blocks  of  message  content  representations  which  will  eventually  provide  answers  to 
Question  1  of  Section  1 :  "What  is  the  content  of  the  Message?" 


4.3.4  Identifying  the  Reported  Source  of  an  Event  Report.  Certain  events  are  officially 
announced.  Thus  a  Soviet  Satellite  launching  may  be  announced  as  follows: 
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(b)  THE  SOVIET  NEWS  AGENCY  TASS  ANNOUNCED 
THAT  IMPACT  OF  COSMOS-954  TOOK  PLACE 
NEAR  YELLOWKNIFE,  CANADA. 


The  system  as  developed  to  date  identifies  the  source  of  this  report  and  interpret  TASS 
as  the  "Infosource"  of  the  launch  event.  Sentence  (b)  yields  the  event  record  shown  in 
Table  4-6. 

Table  4-6.  Event  Record  for  Sentence  (b) 

+ - + 

I Infosource*  THE  SOVIET  NEWS  AGENCY  TASS  I 

I  Event:  IMPACT  I 

I Action=  IMPACT  I 

lObJect:  SATELLITE  I 

I  ...  Equipment  =.  COSMOS-954  I 

I  ...  Number^  I 

I  ...Relatives  I 

I  Locations  NEAR  YELLOWKNIFE  CANADA  I 

I  I 

+ - + 

4.3.5  Identifying  the  Reported  Status  of  an  Event.  Some  events  are  reported  as 
expected,  or  as  having  failed  in  some  sense.  The  following  two  examples  illustrate  how 
the  current  version  of  the  system  processes  sentences  expressing  these  notions. 


(c)  THE  ORBITAL  WORKSHOP  SKYLAB  WAS 
EXPECTED  TO  DEORBIT  OVER  CANADA. 


Table  4-7.  Event  Record  for  Sentence  (c) 

+ - + 

IStatuss  EXPECED  I 

I  Event:  DEORBIT  I 

I  Actions  DEORBIT  I 

lObject:  SATELLITE  I 


I  ...Equipment  =  ORBITAL  WORKSHOP  SKYLAB  I 
I  ...Numbers  | 

I  ...Relatives  I 
(Locations  OVER  CANADA  I 
I  I 
+ - + 
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(d)  THE  ORBITAL  WORKSHOP  SKYLAB  FAILED 
TO  DEORBIT  INTO  CANADA 


Table  4-8.  Event  Record  for  Sentence  (d) 

+ - + 

|Status=FAILED  I 

I  Event:  DEORBIT  I 

I Action=  DEORBIT  I 

lObject:  SATELLITE  I 

I  ...Equipment  =  ORBITAL  WORSHOP  SKYLAB  I 


I  ...Number*  I 
I  ...Relative*  I 
I  Location*  INTO  CANADA  I 
I  I 
+ - + 


As  pointed  out  in  the  section  on  the  Characteristics  of  Messages,  the  key  event 
described  in  a  message  is  usually  mentioned  in  the  first  sentence.  This  sentence  intro¬ 
duces  the  TOPIC  of  the  message,  i.e.,  what  the  message  is  about. 


In  order  to  test  and  evaluate  the  capabilities  of  the  system,  a  set  of  sentences  were 
constructed  modeled  on  the  syntactic  properties  of  first  sentences  of  the  various  mes¬ 
sage  types  identified  in  the  missile  and  satellite  domain.  The  sentences  were  so 
designed  as  to  test  various  aspects  of  the  syntactic  structure  of  the  sublanguage. 
Examples  of  sentence  types  currently  processed  by  the  system  together  with  their 
corresponding  event  records  are  offered  In  Appendix  (E). 
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SPANAME  CONFIRMED  IN  ORBIT  ?  2MMBR  MNTH  4NMBRT$ 

SPANAME  HAS  ACHIEVED  ORBIT** 

I  HIS  SATELLITE  IS  A  FIRST  GENERATION  LOW  RESOLUTION  PHOTGRAPHIC  SATELLIT 
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3.  SPANAME  WAS  LAUNCHED  FROM  PLACENAME  AT  ZULUTIME  TODAY$$ 
PRESSNAME  HAS  Nt)T  YET  ANNOUNCED  SPANAME' S  ORBITAL  PARAMETERS$$ 
$$$  MSG  5 


COMPLEX  BY  THE  LSTYPE  SPACE  BOOSTER  AT  ZULUTIME  THIS  DATE  HAS  BEEN 
IDENTIFIED  AS  AN  UNIDENTIFIED  MILITARY  SUPPORT  SATELLITE  (LSTYPE)  OF 
THE  SPATYPE  3NMBR»  3NMBR »  3NMBR  CLASS »  WHICH  WERE  FRAGMENT 
DISPENSING  SATELLITES$$ 

$$$  MSG  ? 


PRESSNAME  ANNOUNCES  LAUNCH  OF  SPANAME$$ 

THE  POL I TAD J  NEWS  AGENCY ,  PRESSNAME,  ANNOUNCED  THE  LAUNCH  OF  SPANAMI 
(SPANAME)  IN  A  ROUTINELY  WORDED  STATEMENT  ON  1NMBR  MNTH  4NMBR$$ 


o 

3  UJ 

) — 

UJ 

3 

-J 

z 

3 

3  3 

3 

Z 

Z 

Cl 

3 

z 

X 

»-  3  3  (Si 

3 

3  3 

cd 

a 

UI 

3 

3 

<C 

Z  3 

Z 

o 

3 

3 

CD 

3 

3 

C  Z  3  <r  3 

X 

Z 

X 

z 

z 

N 

i— 

3  <r  » 

3 

x 

CJ 

1 — 

CD 

<£ 

3 

X 

3  X  (Si  (Si 

O  X 

3 

<c 

1-1 

Cl 

UJ 

HH 

CJ  w 

CD 

3 

<r 

3 

3 

<r 

3  X  1  3 

3 

3  Ol 

3 

x 

3 

UI 

X 

3 

|X| 

CD  3  3 

1 

CD 

3 

X 

CD 

3 

X 

3 

tfi- 

X  Z  3  X 

Z 

1-1  Z 

CD 

ZD 

I 

3 

i—i 

Li 

'  fl  I- 

3 

3 

CD 

X 

3 

CJ 

T 

4*  h~ 

X  X  Z  Z 

3 

z  z 

Ui 

f— 1 

_l 

cj 

UI 

CD 

o 

3  Z  3 

z 

3 

3 

3 

3 

3 

3 

3  3 

(Si  <t  <x  o 

z 

CJ 

I 

<x 

z 

3 

CD 

1-  3  Z 

o 

3 

Z 

X 

4* 

<r 

z 

CJ 

3  <r 

3  Z  CJ 

UJ 

X 

3 

z 

N 

3 

<r 

3 

A 

3  CO 

CJ 

04 

o 

z 

X 

<c 

Z 

O  3 

3  3 

cj 

fli  z 

z 

<C 

z 

Z 

LlI 

3X3 

3 

3 

X 

Z 

z 

<r 

3 

X 

3 

1  U 

X  X  Oi  Z 

z 

z  o 

X 

o 

l— 

a  _i 

3 

j-h 

3  DC 

3  CD 

CD 

3 

O  O 

(Si 

3  3 

<t  <X  Z  3 

X 

Z  1-4 

3 

CJ 

<1 

t*4. 

X 

UI 

2: 

3  <L  O 

3 

CD 

3 

PH 

X 

o 

3 

3  CJ 

3  3  Z  O 

o 

CO  3 

Ui 

CD 

o 

z 

UJ 

l-ZI- 

3 

O 

3 

CD 

h- 

X 

3 

3 

Z 

3  <X 

X  3 

z 

<x 

UI 

X 

X 

CD  CD 

X 

<r 

K 

<r  3 

CD  3 

4* 

3 

3 

CD 

u 

3 

X 

<E 

3  3 

3X3 

z 

UJ  z 

z 

UI 

01 

<x 

X 

z 

z: 

CD  Z  Q 

CD  Ot 

4* 

X 

3  Z 

3 

3 

3 

Z 

X 

3  Z  3  3 

<x 

UJ  1-1 

<t 

o 

_l 

XL 

X 

u 

PH 

3  3 

Z  3 

i-  <r 

CJ 

Z 

X 

Z 

3 

3 

3 

3  <r  X  Z 

C D  _J 

z 

3 

Cl 

cj 

<r 

cj 

CD  3  3 

CD 

1 — J 

z 

3 

3 

3 

CD 

3 

3  (Si 

X  Z  3 

3 

O  O 

c 

o 

z 

& 

<c 

4* 

<E 

UJ 

3  3  3 

»  CD 

Oi 

X 

3 

3 

3 

3 

X 

3  3  3 

3  <r  3 

I 

0.  Z 

Cl 

I 

o 

4* 

Cl 

4* 

i'¬ 

3 

x 

x  cic  <r 

*  O  3  X 

♦ 

<r 

X 

<r 

3 

<r 

X 

o 

3  X 

X  3  Z  X  Z 

l — 

<E  i-t 

X 

D, 

a 

4* 

CD 

ll 

LL 

l— 

3  Ti  li 

W  3 

D>  O  3 

3 

i— 

z 

X 

3 

1 — 

u_ 

<C  3 

3X333 

A-4 


i 


RIGHT  ASCENSION  BY  2NMBR  DEGREES$$ 


2.  THE  LAUNCH  TIME  FOR  THE  ILL-FATED  SPACECRAFT  RESULTED  IN  A  RIGHT 
ASCENSION  OF  ABOUT  3NMBR  DEGREES r  WHICH  IS  2NMBR  DEGREES  FROM  THE 
EXISTING  NETWORK  AND  IS  ALMOST  3NMBR  DEGREES  OUT-OF-PHASE  WITH 
SPANAME$$ 
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APPENDIX  B:  Lexicon 
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LEXI CON 
FEATURE  1DIG 
FEATURE  2DIG 
FEATURE  3DIG 
FEATURE  ADI G 
FEATURE  5DIG 
FEATURE  6DIG 
FEATURE  ACRAFT 
FEATURE  ADVB 
FEATURE  ADJ 
FEATURE  ACTVTY 
FEATURE  ALT 
FEATURE  ARRIVE 
FEATURE  BE 
FEATURE  BEFORE 
FEATURE  EOOSTER 
FEATURE  COMM 
FEATURE  CONFIRM 
FEATURE  CONTINUE 
FEATURE  CONJ 
FEATURE  COPULA 
FEATURE  DAYTE 
FEATURE  DEMONS 
FEATURE  DEORBIT 
FEATURE  DEPART 
FEATURE  DEPLOY 
FEATURE  DIR 
FEATURE  ART 
FEATURE  EVAL 
FEATURE  EMOD 
FEATURE  ENROUTE 
FEATURE  FLIGHT 
FEATURE  GO 
FEATURE  HEAD 
FEATURE  HAVE 
FEATURE  IMPACT 
FEATURE  INCL 
FEATURE  INTRANS 
FEATURE  LOC 
FEATURE  LOCATE 
FEATURE  LAND 
FEATURE  LAUNCH 
FEATURE  MISSILE 
FEATURE  MO 
FEATURE  MODAL 
FEATURE  EVENTIVE 
FEATURE  NUM 
FEATURE  NUMMOD 
FEATURE  NATION 
FEATURE  NATO 


FEATURE  N 
FEATURE  OBSERVE 
FEATURE  ORD 
FEATURE  PASTP 
FEATURE  PMOD 
FEATURE  POSPRO 
FEATURE  PREDET 
FEATURE  PRESP 
FEATURE  PRO 
FEATURE  PRTCL 
FEATURE  PREP 
FEATURE  PENETRATE 
FEATURE  QUANT 
FEATURE  REENTRY 
FEATURE  REF 
FEATURE  RETURN 
FEATURE  RELPRO 
FEATURE  REV 
FEATURE  ROBJ 
FEATURE  RSUBJ 
FEATURE  SATELLITE 
FEATURE  SCONJ 
FEATUPE  SERVICE 
FEATURE  SUBNUM 
FEATURE  SUPER 
FEATURE  STAGE 
FEATURE  THATCOMP 
FEATURE  TJPE 
FEATURE  TOCOMP 
FEATUPE  TYME 
FEATURE  TENSED 
FEATURE  TRANS 
FEATURE  UNIT 
FEATURE  VB 
FEATURE  VMOD 
FEATURE  VPASSIVE 
SJ  <  AIR  FORCE)  C  N  3  .  J 
: i  (  AIR  REGIMENT)  t  N  3  .  J 
::  (  AIR  SPACE)  l  N  LOC  3 
: :  (  AL  JAGHBUB)  C  N  LOC  3 
<  A  MINIMUM  OF)  C  NUMMOD  3 
ss  <  ARABIAN  SEA)  t  N  LOC  3 
s:  <  AS  FAR)  l  PREP  EMOD  3 
t:  <  AS  MANY  AS)  C 'NUMMOD  3 
it  <  AT  LEAST)  C  NUMMOD  3  .J 
tt  <  AT  MOST)  C  NUMMOD  3  .J 
s:  C  BARENTS  SEA)  C  N  LOC  3  . ; 
s  J  (  BOMBER  CORPS)  C  N  3 
!S  <  BUFF  A)  C  N  NATO  ACRAFT  3  .1 

s:  <  BUFF  B)  C  N  NATO  ACRAFT  3  .1 

it  (  BUFF  C)  C  N  NATO  ACRAFT  3 

:  t  (  BUFF  D)  C  N  NATO  ACRAFT  3 

ti  (  BUFF  0)  t  N  NATO  ACRAFT  3  ,} 

it  (  CAPE  VERDE  ISLANDS)  t  N  LOC  3 
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t:  <  COMMAND  AND  CONTROL)  C  ADJ  3  ' 

s:  <  COMMAND  AND  SERVICE)  Z  ADJ  3  .  J 

::  <  CONTROL  AND  REPORTING)  C  ADJ  3 

::  (  DEORBITED  AND  RECOVERED)  C  VB  TRANS  TENSED  DEORBIT  3 
*  Z  VB  TRANS  PASTP  DEORBIT  3  . 

::  (  EARTH'S  ATMOSPHERE)  Z  N  LOC  3  .  J 
::  (  EAST  OF)  C  PREP  3  .  ; 

::  (  GULF  OF  ADEN)  I  N  LOC  3  .  ) 

::  (  GULF  OF  AQABA)  C  N  LOC  3 

::  (  HEAVY  BOMBERS)  C  N  ACRAFT  3  .  ; 

::  (  HEAVY  BOMBER)  Z  N  ACRAFT  3 

:  :  (  IN  CONJUNCTION  VI TH )  t  PREP  3 

i:  (  IN  CONNECTION  VI TH)  C  PREP  ■  3 
j:  <  IN  REACTION  TO)  Z  PREP  3  .5 
::  <  INDIAN  OCEAN)  C  N  LOC  3 

::  (  LACCADIVE  ISLANDS)  C  N  LOC  3  .  ) 

s:  (  LAUNCH  SYSTEM)  C  N  BOOSTER  3  .J 

!  :  (  LAUNCH  VEHICLE)  t  N  BOOSTER  3  .) 

::  <  MALDIVE  ISLANDS)  C  N  LOC  3  .) 

::  (  MEDIUM  BOMBER)  t  N  ACRAFT  3  .} 

<  MEDIUM  BOMBERS)  C  N  ACRAFT  1  .) 

:  :  (  MIRAGE  III)  C  N  TJPE  ACRAFT  3  .) 

::  <  NATIONAL  GUARDS)  C  N  3 

::  (  NATIONAL  GUARD)  C  N  3 

::  <  NORTH  OF)  [  PREP  3 

::  <  NORTHEAST  OF)  C  PREP  3 

::  (  ORBITAL  WORKSHOP)  C  N  SATELLITE  3  .  ; 

::  C  OUTSIDE  OF)  C  PREP  3 
j:  <  NORTHVEST  OF)  C  PREP  3 
:  :  <  RED  SEA)  C  N  LOC  3  •  > 

::  C  SAUDI  ARABIAN)  C  ADJ  NATION  3  .J 

:  :  (  SEA  OF  CRISES)  C  N  LOC  3 

:  s  (  SEYCHELLE  ISLANDS)  C  N  LOC  3  .5 

::  (  SEYCHELLE  ISLAND  CHAIN)  C  N  LOC  3 

::  (  SMALL  SCALE)  C  ADJ  3 

s:  (  SOUTH  AFRICAN)  C  ADJ  NATION  3 

::  <  SOUTH  OF)  I  PREP  3  •) 

::  <  SOUTHEAST  OF)  Z  PREP  3  .1 

::  (  SOUTHWEST  OF)  C  PREP  3  .J 

::  (  SOVIET  UNION)  C  N  LOC  NATION  3 

::  <  ST  HELENA)  C  N  LOC  3  .) 

::  (  SPACE  CENTER)  C  N  LOC  3 

::  (  TEST  CENTER)  C  N  LOC  3 

::  (  TEST  RANGE)  Z  N  LOC  3  . J 

::  (  TOOK  PLACE)  C  VB  TENSED  INTRANS  3 

::  (  TAKE  PLACE)  C  VB  TENSED  INTRANS  3 

j:  (  WEST  OF)  C  PREP  3 

::  (  WHITE  SEA)  t  N  LOC  3  J 

s  :  AC  ART  3 

::  A3 13  C  N  SUBNUM  3 

t  :  AA  C  ADJ  3  .  J 

s:  ABOUT  C  A DVB  EVAL  PPEDET  3  C  PREP  EMOD  3 
::  ACFT  C  N  ACRAFT  3 


:  t  ACHIEVE  C  VB  TRANS  3  .3 

it  ACTIVE  t  ADJ  3  C  VB  ACTVTY  3  .3 

t:  ACTIVITY  C  N  EVENTIVE  3  .1 

It  ACTY  C  N  EVENTIVE  3  .3 

It  ADDITIONAL  C  ADJ  REF  3 

si  ADX  C  N  EVENTIVE  3  .3 

IS  ADJ  C  N  LOC  3  .3 

is  AFRICAN  C  ADJ  3  .3 

: t  AFTER  C  PREP  TYME  SCONJ  3  .3 

it  AGAINST  t  PREP  EMOD  3  .3 

ti  AGENCY  C  N  3  .3 

it  AIR-TO- SURFACE  t  ADJ  3  .3 

it  AIR  t  N  3  .3 

it  AIRBORNE  C  ADJ  3  C  VB  FLIGHT  3  .3 
it  AIRCRAFT  t  N  ACRAFT  3  .3 
it  ALEXANDRIA  C  N  LOC  3  .3 
l  l  ALL  C  QUANT  3  . 3 
it  ALONG  C  PREP  EMOD  3  .3 
ti  ALTITUDE  C  N  ALT  3  .3 
It  ALTITUDES  C  N  ALT  3  .3 
It  AN  C  ART  1  .3 
l  t  AND  C  CONJ  3.3 

si  ANNOUNCED  C  VB  TRANS  TENSED  THATCOMP  COMM  3 
C  VB  TRANS  PASTP  THATCOMP  COMM  3  .3 
is  APOLLO  t  N  SATELLITE  3  .3 
is  APPROXIMATELY  t  ADVB  EVAL  PREDET  3  .3 

I  s  APR  C  N  TYME  MO  3  .3 

si  APRIL  C  N  TYME  MO  3  .3 

It  ARE  C  BE  3  C  COPULA  3  C  VB  TRANS  3  .3 
I!  AREA  C  N  LOC  3  .3 
si  AREAS  C  N  LOC  3  .3 

II  ARRIVED  C  VB  PASTP  ARRIVE  3  [  VB  TENSED  ARRIVE  3 
II  AS  C  PRTCL  3  .3 

It  ASSOCIATED  C  ADJ  3  .3 

II  ASM  C  ADJ  3  .3 

is  ASTRONAUT  t  N  3  .3 

IS  ASV  C  ADJ  3  .3 

is  AT  C  PREP  EMOD  TYME  3  .3 

II  ATLANTIC  C  N  LOC  3  .3 

n  ATMOSPHERE  C  N  LOC  3  .3 

is  AUG  C  N  TYME  MO  3  .3 

II  AUGUST  C  N  TYME  MO  3  .3 

is  AUSTRALIA  C  N  LOC  NATION  3  .3 

ii  AUSTRALIAN  C  ADJ  3  .3 

si  AUXILIARY  C  ADJ  3  .3 

is  AVIATION  t  N  EVENTIVE  3  .3 

II  A-4  C  N  TJPE  ACRAFT  3  .3 

II  B-7S  C  N  TJPE  ACRAFT  3  .3 

si  B-753  C  N  TJPE  ACRAFT  3  .3 

it  B- 60  C  N  TJPE  ACRAFT  3  .3 

it  B- 60 '  S  C  N  TJPE  ACRAFT  3  .3 

ii  B-6I  C  N  TJPE  ACRAFT  3  .3 


s:  B-63  C  N  TJPE  ACRAFT  ] 

::  B-63 '  S  C  N  TJPE  ACRAFT  3  .3 
II  B-63S  C  N  TJPE  ACRAFT  3  .3 
::  B- 67 S  t  N  TJPE  ACRAFT  3  .3 
::  B-80  t  N  TJPE  ACRAFT  3  .3 
B-TYPE  C  N  TJPE  ACRAFT  3  .3 
::  B" TYPES  C  N  TJPE  ACRAFT  3  .3 
xx  BACTERIOLOGICAL  t  ADJ  3  .3 
: :  BAIKONUR  C  N  LOC  3  .3 
::  BARFLY  t  N  NATO  ACRAFT  3  .3 
::  BASE  C  N  LOC  3  .J  > 

s:  BASFD  C  PASTP  3  . 3 
::  BC254  t  N  SUBNUM  3  .3 
:  :  BE  C  BE  3  C  COPULA  3  t  VB  TRANS  3 
::  BEACON  t  N  NATO  ACRAFT  3  .3 
:  t  BEAGLE  C  N  NATO  ACRAFT  3  . 3 
it  BED  C  N  3  .J 

: :  BEEN  C  BE  PASTP  3  t  COPULA  PASTP  3  C  VB  TRANS  PASTP  3  .3 

is  BEETLES  C  N  3  .J 

j:  BEFORE  t  PREP  TYME  SCONJ  3 

::  BEING  C  BE  PRESP  3  i  COPULA  PRESP  3  C  VB  TRANS  PRESP  3  .i 

::  BETWEEN  C  PREP  EMOD  TYME  3  .J 

SS  BOMBER  C  N  ACRAFT  3  .J 

s:  BOMBERS  C  N  ACRAFT  3 

::  BOOSTER  C  N  BOOSTER  3  .3 

::  BORDER  t  N  LOC  3  .J 

::  BOUNDED  t  ADJ  3  .J 

:  t  BUFF  C  N  NATO  ACRAFT  3  .} 

::  BUJUMBURA  C  N  LOC  3  .J 
::  BUTTER  C  N  NATO  ACRAFT  3 
::  BY  C  PREP  EMOD  TYME  3  .} 
j:  CANADA  C  N  LOC  3  .} 

::  CAPETOVN-BASED  C  ADJ  3  • } 
xx  CAPETOWN  C  N  LOC  3 
t :  CAPSULE  C  N  3  .3 
t:  CARRYING  C  ’VB  TRANS  PRESP  3 
!!  CENTER  I  N  LOC  3  .J 
t:  CENTRAL  C  ADJ  3  .} 
x x  COAST  C  N  LOC  3  .  J 
::  COLLECTION  C  N  EVENTIVE  3  . J 
::  COMBAT  C  N  EVENTIVE  3  .} 

XX  COMBATANT  C  N  3  .i 
::  COMBATANTS  t  N  3  .J 
XX  COMMUNICATION  C  N  3  .5 
J  t  COMPLEX  C  N  LOC  3  . S 
is  CONDUCTING  C  VB  TRANS  PRESP  3 

it  CONDUCTED  C  VB  TRANS  PASTP  3  C  VB  TRANS  TENSED  3  .! 
it  CONFIRMED  t  VB  TRANS  PASTP  CONFIRM  3 

t  VB  TRANS  TENSED  CONFIRM  3  .3 
I  t  CONGO  C  N  LOC  3  . 3 
Si  CONTAINING  t  VB  TRANS  PRESP  3  .3 

It  CONTINUED  C  VB  TRANS  PASTP  3  C  VB  TRANS  TENSED  3  .3 
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::  CONTINUING  E  VB  TRANS  PRESP  DIR  CONTINUE  3  .  f 

:s  CONTINENTAL  C  ADJ  3 

ss  CONTROLLER  £  N  3 

:s  CONVERTED  E  ADJ  3 

ss  CORNER  E  N  3 

ss  COSMODROME  C  N  LOC  3  .} 

ss  COSMONAUT  C  N  3  .} 

ss  COSMONAUTS  E  N  3 

SS  COSMOS  C  N  SATELLITE  3 

ss  COSMOS-605  C  N  SATELLITE  3  .  7 

ss  COSMOS- 629  C  N  SATELLITE  3  .7 

SS  COSMOS-706  C  N  SATELLITE  3 

ss  COSMOS-722  £  N  SATELLITE  3  .7 

ss  CRAFT  C  N  SATELLITE  3  .7 

SS  CSM  C  N  SATELLITE  3  .J 

ss  CURRENTLY  C  ADVB  REF  TYME  3  .7 

ss  DAMASCUS  C  N  LOC  3  .7 

ss  DATE  t  N  DAYTE  3  .7 

ss  DAY  C  N  DAYTE  3  .7 

ss  DEC  £  N  TYME  MO  3  J 

ss  DECEMBER  C  N  TYME  MO  3  .7 

ss  DEFENSIVE  C  ADJ  3  .7 

ss  DEGREE  C  N  3  .7 

ss  DEGREES  C  N  3  .J 

ss  DELTA-CLASS  £  ADJ  3  .7 

ss  DEORBIT  £  N  EVENTIVE  DEORBIT  3  £  VB  DEORBIT  3  .1 
ss  DEO RBI  TED  £  VB  TRANS  PAST?  DEORBIT  3 

E  VB  TRANS  TENSED  DEORBIT  3  .7 
ss  DEPARTED  £  VB  TRANS  PASTP  DEPART  3 

E  VB  TRANS  TENSED  DEPART  3  .7 
ss  DEPLOYED  E  VB  TRANS  PASTP  DEPLOY  3 

£  VB  TRANS  TENSED  DEPLOY  3  .7 
ss  DEPLOYMENTS  E  N  EVENTIVE  3 
SS  DESTINATION  E  N  3  .7 
s  s  DIVISION  £  N  3  .J 
ss  DJIBOUTI  E  N  LOC  3  .7 

ss  DOWNED  £  VB  TRANS  PASTP  3  E  VB  TRANS  TENSED  3  .7 

ss  DOWNRANGE  E  N  LOC  3  .7 

ss  DURING  E  PREP  EMOD  TYME  3  .7 

SS  El  65  ID  E  N  SUBNUM  3  .7 

ss  EABC  E  N  NATION  -3  .1 

ss  EAFAF  E  N  NATION  SERVICE  3  .7 

ss  EARLIER  C  ADVB  TYME  REF  VMOD  1  .J 

ss  EARLY  £  ADVB  PMOD  3  E  ADJ  TYME  3  E  ADVB  TYME  3  .1 

ss  EARTH  E  N  3  .7 

ss  EAST  E  ADVB  DIR  3  £  ADJ  3 

ss  EASTERN  E  ADJ  3  .  i 

ss  EGYPTIAN  C  ADJ  NATION  3  .1 

ss  EIGHT  C  N  NUM  3  .1 

ss  EQUATOR  E  N  LOC  3  .7 

ss  ENROUTE  C  VB  TOCOMP  ENROUTE  3  .7 


tt  ENTEBBE  C  N  LOC  1  .3 
tt  EQUIPMENT  C  N  3  .3 
tt  ETBF  C  N  LOC  1  .3 
tt  ETHIOPIAN  C  ADJ  1  .3 
tt  EXERCISE  C  N  EVENTIVE  3  .J 

tt  EXPECTED  C  VB  TRANS  TENSED  TOCOMP  EVAL  ROBJ  3 

C  VB  TRANS  PAST?  TOCOMP  EVAL  ROBJ  3  .3 
tt  F-4  C  N  TJPE  ACRAFT  3  .3 
ft  F-4E  C  N  TJPE  ACRAFT  3  .J 
tt  F-5E  C  N  TJPE  ACRAFT  3  . J 

tt  FAILED  C  VB  INTRANS  TENSED  TOCOMP  EVAL  RSUBJ  3 

C  VB  INTRANS  PASTP  TOCOMP  EVAL  RSUBJ  3  .3 
::  FEB  C  N  TYME  MO  3  . 3 
tt  FEBRUARY  C  N  TYME  MO  3  .J 
tt  FERRY  C  N  3  .3 

tt  FIGHTER-BOMBERS  C  N  ACRAFT  3  • ! 
it  FIGHTER  C  N  ACRAFT  3  .3 
::  FIGHTERS  C  N  ACRAFT  3  .3 

tt  FIRED  t  VB  TRANS  PASTP  LAUNCH  3  t  VB  TRANS  TENSED  LAUNCH  3 
t  t  FLEET  t  N  3  . 3 

tt  FLEW  C  VB  TRANS  TENSED  DIR  FLIGHT  3  .3 

tt  FLIGHT  C  N  EVENTIVE  FLIGHT  3  .3 

it  FLIGHTS  C  N  EVENTIVE  FLIGHT  3  .3 

tt  FLOGGER  C  N  NATO  ACRAFT  3  .3 

tt  FODDER  C  N  NATO  ACRAFT  3  .3 

t:  FOLLOWING  C  SCONJ  3  .3 

it  FOR  C  PREP  3  .3 

si  FOUR  C  N  NUM  3  .3 

t:  FRESCO  C  N  NATO  ACRAFT  3  .3 

::  FROM  C  PREP  EMOD  TYME  3  .3 

tt  GENERAL  C  ADJ  3  .3 

::  GROUP  C  N  3  .3 

tt  GUAM  C  N  LOC  3  .3 

it  GULU-BASED  C  ADJ  3  .3 

t  t  GULU  C  N  LOC  3.3 

ft  HAD  I  HAVE  3  C  VB  TRANS  PASTP  3  C  VB  TRANS  TENSED  3  .3 

tt  HAIFA  f  N  LOC  3.3  j 

tt  HAS  C  HAVE  3  C  VB  TRANS  TENSED  3  .3 

tt  HAVE  t  HAVE  3  t  VB  TRANS  3  .3 

tt  HAVING  C  VB  TRANS  PRESP  3  .3 

t:  HEADING  t  VB  PRESP  DIR  HEAD  3  .3 

tt  HERMETICALLY  C  ADVB  3  .3 

tt  HOME8ASE  C  N  LOC  3  .,3 

tt  HOUR  C  N  TYME  UNIT  3  .3 

tt  HOURS  C  N  TYME  UNIT  3  .3 

tt  HR  t  N  TYME  UNIT  3  .3 

tt  IL-2S  C  N  TJPE  ACRAFT  3  .3 

tt  IMPACT  C  N  EVENTIVE  IMPACT  3  C  VB  INTRANS  IMPACT  3  .3 
tt  IMPACTED  t  VB  INTRANS  TENSED  IMPACT  3  .3 
tt  IN  C  PREP  EMOD  3  .3 
tt  INCLINATION  C  N  INCL  3  .3 


i 


i 


INDEPENDENT  t  ADJ  3  .1 

INDICATES  C  VB  TRANS  THATCOMP  3  .3 

INFORMED  C  VB  TRANS  PASTP  THATCOMP  COMM  3 

C  VB  TRANS  TENSED  THATCOMP  COMM  3  .1 
INITIAL  C  ADJ  3  .3 
INTELLIGENCE  C  N  3  .3 
INTO  C  PREP  EMOD  3  .3 
INVOLVING  t  VB  TRANS  PRESP  3  .3 
IRANIAN  C  ADJ  NATION  3  .3 
IS  t  BE  3  t  COPULA  3  i  VB  TRANS  3  .3 
ISRAEL  t  N  LOC  NATION  3  .3 
ISRAELI  C  ADJ  NATION  3  .3 
JAN.  C  N  TYME  MO  3  .3 
JANUARY  C  N  TYME  MO  3  . 3 
JUBA  t  N  LOC  3  .3 
JUN  t  N  TYME  MO  3  .3 
JUNE  C  N  TYME  MO  3  .3 
JUL  C  N  TYME  MO  3  . 3 
JULY  t  N  TYME  MO  3  . 3 
JUST  C  A DVB  PMOD  3  .3 
KALGOORLIE  t  N  LOC  3  .3 
KATHMANDU  C  N  LOC  3  .3 
KB252  C  N  SUBNUM  3  .3 
KE843  C  N  SUBNUM  3  .3 
KENNEDY  C  N  3  .3 
KENYA  C  N  LOC  NATION  3  .3 
KENYAN  C  ADJ  NATION  3  .3 
KFIR  t  N  TJ PE  ACRAFT  3  .3 
KILOMETERS  C  N  LOC  UNIT  3  .3 
KINSHASA  C  N  LOC  3  .3 
KM  C  N  LOC  UNIT  3  .3 
KMS  C  N  LOC  UNIT  3  .3 
LACCADIVES  C  N  LOC  3  .3 

LANDED  t  VB  PASTP  LAND  3  C  VB  TENSED  LAND  3  .3 

LANDMASS  C  N  LOC  3  .3 

LAST  t  ADJ  3  .3 

LATE  £  ADJ  TYME  3  .3 

LATER  C  ADJ  TYME  3  .3 

LAUNCH  C  N  EVENTIVE  LAUNCH  3  .3 

LAUNCHED  C  VB  TRANS  PASTP  LAUNCH  3 

C  VB  TRANS  TENSED  LAUNCH  3  .3 
LAUNCHER  C  N  BOOSTER  3  .3 
LEBANON  t  N  LOC  NATION  3  .3 
LIBYAN  C  ADJ  NATION  3  .J 
LIVING  C  ADJ  3  .3 

LOCATED  t  ADJ  VB  TRANS  PASTP  LOCATE  3 
C  VB  TRANS  TENSED  LOCATE  3  .3 
LUNA- 23  t  N  SATELLITE  3  .3 
LUNAR  C  ADJ  3  .3 
MALAGASY  C  N  LOC  3  .3 
MANEUVERABLE  t  ADJ  3  .3 
MANNED  C  ADJ  3  .3 
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MANY  C  QUANT  3 

MAR  t  N  TYME  MO  3  . 3 

MARCH  C  N  TYME  iO  3 

MARITIME  C  ADJ  3  .3 

MASSAVA  t  N  LOC  3  .3 

MAURITIUS  C  N  LOC  3  .3 

MAY  C  MODAL  3  t  N  TYME  MO  3  .3 

METERS  C  N  LOC  UNIT  3  .3 

MID  C  ADJ  3  .3 

•'TG-17  C  N  TJ PE  ACRAFT  3  .3 

MiG-21  C  N  TJPE  ACRAFT  3  .1 

MIG-23  C  N  TJPE  ACRAFT  3  .3 

MILES  C  N  LOC  UNIT  3  .3 

MILITARY  C  ADJ  3  .3 

MINUTE  C  N  TYME  UNIT  3 

MINUTES  I  N  TYME  UNIT  3  .3 

MISSILE  C  N  MISSILE  3  .3 

MISSILES  t  N  MISSILE  3  .3 

MISSION  C  N  EVENTIVE  3  .3 

ML-28  I  N  TJPE  ACRAFT  3  . S 

MODIFIED  t  ADJ  3 

MODULE  C  N  SATELLITE  3 

MOGADISHU  t  N  LOC  3  .i 

MOMBASA-BASED  t  ADJ  3  .3 

MOMBASA  C  N  LOC  3  . 

MONTH  C  N  TYME  3  .J 
MOON  C  N  LOC  3  .3 
MORNING  C  N  TYME  3 
MOST  C  QUANT  3 
MUSHROOM  C  N  3  .3 
NAIROBI -BASED  C  ADJ  3  .3 
NAIROBI  C  N  LOC  3  .3 
NASA  C  N  NATION  3  .3 
NATIONAL  C  ADJ  3  .3 
NATURE  C  N  3  .3 
NAUTICAL  t  ADJ  3  .3 
NAVIGATIONAL  C  ADJ  3  .3 
NEAR  C  PREP  EMOD  3  .3 
NEPAL  C  N  LOC  NATION  3  .3 
NEWS  C  N  3  .3 

NIGERIAN  C  ADJ  NATION  3  .3 

NINE  t  N  NUM  3  .3 

NM  C  N  LOC  UNIT  3  .3 

NMS  t  N  LOC  UNIT  3  .3 

NO  C  QUANT  3  .3 

NORMAL  C  ADJ  3  .3 

NORTH  t  ADVB  DIR  I  t  ADJ  3  .3 

NORTHEAST  t  ADVB  DI R  3  C  ADJ  3  .3 

NORTHERN  C  ADJ  3  .3 

NORTHWEST  C  ADVB"  DIR  3  C  ADJ  3  .3 

NORTHWESTERN  f  ADJ  3  .3 
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3  NOTED  t  VB  TRANS  PASTP  OBSERVE  3  C  VB  TRANS  TENSED  OBSERVE  3  .3 
s  NOV  C  N  TYME  MO  3  .3 
3  NOVEMBER  t  N  TYME  MO  3  .3 
3  NYANJA  t  N  LOC  3  .3 
t  OCCUR  C  VB  INTRANS  3  .3 
:  OCCURRED  C  VB  TENSED  INTRANS  3  .3 
:  OCEAN  C  N  LOC  3  .  3 
3  OCT  C  N  TYME  MO  3  .3 
I  OCTOBER  C  N  TYME  MO  3  .3 
:  OF  t  RRER  3  .3 
J  ON  C  PRER  EMOD  DAYTE  3  .3 
S  ONE  t  N  NUM  3  .3 
:  OPEN  t  ADJ  3  .3 

:  OPERATED  C  VB  PASTP  ACTVTY  3  C  VB  TENSED  ACTVTY  3  .3 
1  OPERATING  C  VB  PRESP  ACTVTY  3  .3 
s  OPERATIONS  C  N  EVENTIVE  3  .3 
:  ORBIT  t  N  LOC  3  .3 
»  ORBITAL  C  ADJ  3  .3 
1  OUTBACK  C  N  LOC  3  .3 
I  OVER  C  PREP  EMOD  3  .3 
t  PACIFIC  t  N  LOC  3  .3 

s  PENETRATED  C  VB  TRANS  PASTP  PENETRATE  3 

C  VB  TRANS  TENSED  PENETRATE  3  .3 
S  PENETRATION  [  N  EVENTIVE  3  .3 
:  PERFORMING  C  VB  TRANS  PRESP  ACTVTY  3  .3 
s  PERIOD  C  N  TYME  3  .  ? 

:  PERTH  C  N  LOC  3  . 3 
3  PHANTOM  C  N  NATO  ACRAFT  3  .3 
:  PILOTS  t  N  3  .3 
J  PLESETSK  t  N  LOC  3  .3 
J  POINT  t  N  LOC  3  .3 
t  PORTION  [  N  3  .3 
i  POSSIBLE  t  ADJ  EVAL  3  .3 
«  POSSIBLY  C  ADVB  EVAL  3  .3 

»  PRECEDED  C  VB  TRANS  PASTP  BEFORE  3  C  VB  TRANS  TENSED 

BEFORE  3  .3 

t  PRECEDES  t  VB  TRANS  TENSED  BEFORE  3  .3 
t  PRESENTLY  [  ADVB  REF  TYME  3  . 3 
:  PRETORIA-BASED  C  ADJ  3  .3 
»  PRETORIA  [  N  LOC  3  .3 
3.  PREVIOUS  t  ADJ  REF  TYME  3  .3 
3  PREVIOUSLY  C  ADVB  REF  TYME  3  .3 
1  PRIMARILY  t  ADVB  EVAL  3  .3 
3  PROBABLE  C  ADJ  EVAL  3  .3 
3  PROBABLY  t  ADVB  EVAL  PMOD  3  .3 
3  PROCEEDING  C  VB  PRESP  DIR  3  .3 
3  RATS  t  N  3  .3 

3  RECONNAISSANCE  C  N  EVENTIVE  3  .3 
3  RECOVERABLE  t  ADJ  3  .3 
3  RECOVERY  t  N  3  .3 
3  REENTER  C  VB  TRANS  REENTRY  3  .3 
«  RE-ENTERED  t  VB  TRANS  TENSED  REENTRY  3  .3 
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t  REENTERED  t  VB  TRANS  TENSED  REENTRY  3  .3 
REENTRY  C  N  EVENT1VE  REENTRY  3  .3 
REGT  C  N  3  . 3 
REGIMENT  C  N  1  .3 
REGIMENTS  C  N  1  • t  + 

REGION  t  N  LOC  1  .3 
REMAIN  C  COPULA  TENSED  3  .3 
REMAINED  Z  COPULA  TENSED  3  .3 
REPORT  C  N  3  .3 

REPORTING  C  VB  TRANS  PRESP  3  .3 
REPRESENTING  C  VB  TRANS  PRESP  3  .3 

RETURNED  [  VB  PASTP  RETURN  3  t  VB  TENSED  RETURN  3  .3 
RETURNING  t  VB  PRESP  RETURN  3  .3 
REVOLUTION  C  N  REV  3  .3 
RGT  C  N  3  .3 
RIYADH  t  N  LOC  3  .3 
ROUTINE  C  ADJ  3  .3 
ROUTINELY  Z  A DVB  3  .3 
S1234B  C  N  SUBNUM  3  .3 
‘  SA554  C  N  SUBNUM  3  .3 
SA622  C  N  SUBNUM  3  .3 
SAFAF  C  N  NATION  SERVICE  3  .3 
SAFLT  C  N  NATION  SERVICE  3  .3 
SATELLITE  C  N  SATELLITE  3  .3 
SATELLITES  C  N  SATELLITE  3  .3 
SATURN- IB  t  N  BOOSTER  3  .3 
SATURN- 5  Z  N  BOOSTER  3  .3 
SAM-3  t  N  MISSILE  3  .3 
SAME  C  ADJ  3  .3 
SC462  I  N  SUBNUM  3  .3 
SCRAMBLED  C  ADJ  3  .5 
SEALED  t  ADJ  3  .3 
SEP  C  N  TYME  MO  3  .3 
SEPTEMBER  C  N  TYME  MO  3  .3 
SEYCHELLES  t  N  LOC  3  .3 
SIBERIA  t  N  LOC  3  .3 
SIMULATED  C  ADJ  3  .3 
SINCE  C  PREP  EMOD  TYME  3  .3 
SIVAH  [  N  LOC  3  .3 
SIX  t  N  NUM  3  .3 
SKYHAVK  Z  N  NATO  ACRAFT  3  .3 
SKYLAB  Z  N  SATELLITE  3  .3 

SOFTLANDED  Z  VB  TRANS  PASTP  LAND  3  Z  VB  TRANS  TENSED  LAND  3  .3 

SOMALIA  C  N  LOC  NATION  3  .3 

SOME  I  QUANT  3  .3 

SOUTH  C  A  DVB  DIR  3  t  ADJ  3  .3 

SOUTHERN  Z  ADJ  3  .3 

SOUTHEAST  C  A  DVB  DIR  3  t  ADJ  3  .3 

SOUTHWEST  C  A  DVB  DI R  3  Z  ADJ  3  .3 

SOUTHWESTERN  Z  ADJ  3  .3 

SOVIET  t  ADJ  NATION  3  .3 

SOYUZ  C  N  SATELLITE  3  .3 
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SOYUZ-22  t  N  SATELLITE- 1  .3 
SOYUZ-28  t  N  SATELLITE  3  .3 
SOYUZ-TYPE  C  ADJ  SATELLITE  1  .  I 
SP265  t  N  SUBNUM  3  •} 

SPACE  t  N  LOC  3  .3 
SPACECRAFT  C  N  SATELLITE  3  .3 
SPACEFLIGHT  C  N  FLIGHT  3  .3 
SPORES  C  N  3  .3 
SR-71  C  N  TJPE  ACRAFT  3*.; 

SS-il  Z  N  MISSILE  3  .1 
S-4B  C  N  3  .3 
STAGE  C  N  3  .3 

STAGING  C  VB  PRESP  STAGE  3  .3 

STATEMENT  C  N  3  .3 

STATION  C  N  LOC  3  .3 

STRATEGIC  Z  ADJ  3  .3 

STRIKE  t  N  EVENTIVE  1  *3 

STRIKES  C  N  EVENTIVE  3  .3 

SUAM  C  N  LOC  3  .3 

SUBMARINE  C  N  3  .3 

SUBORDINATE  C  ADJ  3  .3 

SUCCESSFULLY  Z  A DVB  EVAL  3  .3 

SUDAN  C  N  LOC  3  .3 

SUDANESE  C  ADJ  1  .3 

SUGGESTS  C  VB  TRANS  THATCOMP  3  .3 

SUPPORT  C  N  EVENTIVE  3  .3 

SURFACE-TO-AIR  Z  ADJ  3  .3 

SURFACE  Z  N  LOC  3  .3 

SURGUT  Z  N  LOC  3  .3 

SURVEILLANCE  [  N  EVENTIVE  3  .3 

SYRIAN  C  ADJ  NATION  3.3 

TAIPEI  C  N  LOC  3  .3 

TAIWAN  [  N  LOC  NATION  3  .3 

TASK  C  N  EVENTIVE  3  .3 

TASS  C  N  3  .3 

TEN  C  N  NUM  3  .3 

TESTING  Z  VB  TRANS  PRESP  3  .3 

THAT  C  CONJ  3  C  RELPRO  3  Z  ART  REF  3  .3 

THE  t  ART  3  .5 

THESE  t  ART  REF  3  .3 

THEY  Z  PRO  3  .3 

THEIR  Z  ART  POSPRO  3  .3 

THIRD  [  ORD  ADJ  3  .3 

THIS  Z  ART  REF  DEMONS  3  .3 

THOSE  t  ART  REF  3  .3 

THREE  Z  N  NUM  3  .3 

TIME  C  N  TYME  3  .3 

TO  C  PREP  EMOD  3  .3 

TOBRUK  C  N  LOC  3  .3 

TODAY  C  A DVB  REF  TYME  VMOD  DAYTE  3  .3 
TORORO  Z  N  LOC  3  .3 
TORTOISES  Z  N  3  .3 
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tt  TRACKING  E  ADJ  3  .3 
it  TRAINING  E  N  3  .3 

tt  TURNED  t  VB  PASTP  DIR  3  E  VB  TENSED  DIR  3  . 

s:  TURNING  E  VB  PRESP  DIR  3  .i 

it  TU-9  5  E  N  TJPE  ACRAFT  3  .3 

: :  TWO  C  N  NUM  3  . i 

: s  TYPE  E  N  3  .  i 

s:  TYRE  t  N  LOC  3  .1 

: :  TYURATAM  E  N  LOC  3  .3 

::  U-43  C  N  TJPE  ACRAFT  *i  .3 

it  U1009B  t  N  SUBNUM  3 

: s  U1 21 1 B  t  N  SUBNUM  3  .3 

ss  U1232  E  N  SUBNUM  3  .i 

: :  U1324B  E  N  SUBNUM  3  .  3 

S  t  UABC  E  N  3  . 3 

ss  UAF  E  N  3  .3 

: :  UBBC  E  N  3  . 3 

tt  UG254  C  N  SUBNUM  3  .3 

a  UG836C  C  N  SUBNUM  3 

::  UGANDA  C  N  LOC  NATION  3 

ss  UGANDAN  E  ADJ  NATION  3 

SS  UNDERWAY  C  VB  FLIGHT  3 

ss  UNDETERMINED  E  ADJ  3  .i 

s:  UNIDENTIFIED  C  ADJ  3 

:s  UNITS  C  N  3 

s  s  VARIOUS  C  ADJ  3  .i 

ss  VEHICLE  C  N  SATELLITE  MISSILE  3 

ss  VICINITY  t  N  3 

ss  VIOLATED  C  VB  TRANS  PASTP  PENETRATE  3 

C  VB  TRANS  TENSED  PENETRATE  3  .1 
SS  WAS  C  BE  3  C  COPULA  3  C  VB  TRANS  3 
ss  WEATHER  C  N  3  .} 

tt  WERE  t  BE  3  C  COPULA  3  E  VB  TRANS  3 

SS  WEST  C  ADVB  DI R  3  C  ADJ  3 

SS  WESTERN  C  ADJ  3 

ss  WESTWARD  C  ADVB  DIR  3 

ss  WHICH  C  RELPRO  PRO  3 

it  WOULD  C  MODAL  3  .1 

SS  X  t  N  LOC  3 

ss  XB442  X  N  SUBNUM  3  .3 

SS  XB262  C  N  SUBNUM  3  .3 

SS  YEAR  C  N  DAYTE  3  .3 

tt  ZEILA  E  N  LOC  3  .3 

ENDLEX 

MCR>BYES 
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APPENDIX  C 


Templates  and  Auxiliary 
Procedures. 
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*****  MASTER  Template  and  auxiliaries  as  of  Aus  10>  1979.  *#### 
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APPENDIX  :  D 


PATTERN  N0 
tS  N  C 

: A  0123456789  ■  >  N1  ,, 

;; 

tS  N 1 

t  A  0123456789  »>  N2  ,, 
i A  S  “>  ORDS  n 
tA  N  »>  ORDN  i, 

< A  R  ■>  ORDR  a 
lA  T  ■>  ORDT  a 
IF  * BLANK*  t  N  NUM  1 D1 G  1  n 
tF  -,/t  C  N  NUM  1DIG  3  n 
it 

tS  N2 

tA  0123456789  >>  N3  // 
tA  S  ■>  N2S  ii 
tA  N  ■>  N2N  ii 
t A  R  ■»  ORDR  a 
tA  T  «»  ORDT  i, 
tF  * BLANK*  C  N  NUM  2DIG  3  ,, 
tF  -i/i  C  N  NUM  2DIG  3  ,, 
li 

tS  N3 

tA  0123456789  ■>  N4  ,, 
tA  S  ■»  ORDS  a 
tA  N  ■»  ORDN  a 
tA  R  *>  ORDR  a 
tA  T  •>  ORDT  a 
tF  * BLANK*  C  N  NUM  3D!  G  3 
tF  -«/t  C  N  NUM  3DIG  3  n 
n 

tS  N4 

tA  0123456789  »>  N5  ## 
tA  Z  ■»  TMF  a 
tA  S  *>  N4S  a 
tA  N  •>  NAN  a 
tA  R  »>  ORDR  a 
tA  T  «>  ORDT  a 
tF  * BLANK*  C  N  NUM  4DX  G  3  i* 
tF  - i/t  C  N  NUM  4D1G  3  n 
It 


D-2 


/ 


tS  N5 

t  A  0123456789  ->  N6  jj 

l A  5  ■>  ORDS  i< 

t A  N  »»  ORDN  jj 

lA  R  «>  ORDR  jj 

tA  T  ■>  ORDT  < < 

tF  * BLANK*  C  N  NUM  5DIG  3 

«F  -j/t  C  N  NUM  5D1G  3  jj 


tS  N6 

tA  0123456789  ->  NF  jj 
tA  Z  ■>  TMF  jj 
tA  S  »  N6S  jj 
tA  N  ->  N6N  jj 
tA  R  »>  ORDR  jj 
tA  T  ■>  ORDT  jj 
tF  * BLANK*  C  N  NUM  6DI G  3 
tF  -j/t  C  N  NUM  6D1G  3  jj 
JJ 


tS  NF 

tA  0123456789  -»  NF  ,, 

tA  S  ->  ORDS  ** 

tA  N  ■>  ORDN  «« 

lA  R  »>  ORDR  j  j 

tA  T  ■>  ORDT  j.j 

tF  * BLANK*  C  N  NUM  3  jj 

tF  -j/t  C  N  NUM  3  jj 

;; 


tS  TMF 

tF  * BLANK*  C  N  TYME  3  jj 
tF  -j/t  t  N  TYME  3  jj 

;; 


tS  ORDS 

tA  T  ■>  ORDF  jj 

n 


tS  ORDN 

tA  D  ■>  ORDF  jj 

;; 


tS  ordr 

tA  D  ■>  ORDF  t » 

•  m 
9  9 


IS  ORDT 

l A  H  ■>  ORDF  ,, 

a 


:S  ORDF 

tF  * BLANK*  C  NUM  ORD  3 
t  F  -,/t  C  NUM  ORD  1  ,, 

;; 


tS  N2S 

t  A  T  ■»  ORDF  »* 
tA  01 23456789  «>  2LN1 
i  i 


tS  N2N 

t A  D  ->  ORDF  ,, 
tA  0123456789  ->  2LN1 

n 


tS  2LN 1 

tA  0123456789  ■  >  2LN2 
it 


tS  2LN2 

tA  EV  ->  2LF  ,, 
it 


tS  2LF 

tF  * BLANK*  C  N  LOC  3  » 
tF  -,/t  C  N  LOC  3 
it 


tS  N4S 

tA  T  ■>  ORDF  ,, 
tA  0123456789  »  4LN1 


it 


IS  NAN 

i A  D  ■>  ORDF  tt 
« A  0 1 23456789  «>  4LN1 
it 


tS  4LN1 

tA  0123456789  »  4LN2 

;; 


tS  4LN2 

l A  0123456789  ■  >  4LN3 
it 


tS  4LN3 

: A  0123456789  »  4LN4 
it 


tS  4LN4 

l A  EW  *>  4LF  ,, 
it 


tS  4LF 

IF  * BLANK*  C  N  LOC  3  * 
IF  -,/t  C  N  LOC  3  ,, 
it 


l  S  N6S 

l A  T  ->  ORDF  ,, 
tA  0123456789  »  6LN1 
it 


tS  N6N 

lA  D  •>  ORDF  tt 
t A  0123456789  ■  >  6LN1 
it 


tS  6LN1 

tA  0123456789  ■  >  6LN2 
it 


tS  6LN2 

lA  0123456789  »  6LN3 

;; 


tS  6LN3 

tA  0123456789  ->  6LN4 


tS  6LN4 

tA  0123456789  »  6LN5 
ll 


tS  6LN5 

i A  0123456789  a>  6LN6 

1 1 


tS  6LN6 

tA  EV  ->  6LF 
l  « 


tS  6LF 

tF  * BLANK*  C  N  LOC  3  # 
IF  >*/t  C  N  LOC  3 
ll 


ENDPATTEBN 
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Event:  LAUNCH 
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LAUNCH 


NASA  ANNOUNCED  THAT  THE  IMPACT  OF  SKYLAB  TOOK  PLACE 
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IMPACT  PROBABLY  OCCURRED  IN  AN.  AREA  ABOUT  100  KMS  FROM 
GOORLIE. 

IMPACT  OP  SKYLAB  WAS  PROBABLY  IN  AN  AREA 
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»  THE  SOVIET  NEWS  AGENCY  TASS  ANNOUNCED  THAT 
SKYLAB  DEORBITED  INTO  THE  INDIAN  OCEAN 
ON  THE  34981ST  REVOLUTION. 
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»  DEORBIT  OF  SKYLAB  OCCURRED  OVER  CANADA  EARLY  ON 
REVOLUTION  34981. 

»  DEORBIT  TOOK  PLACE  EARLY  ON  REVOLUTION  34981. 

»  NASA  DEORBITED  THE  SATELLITE  INTO  THE 
INDIAN  OCEAN  ON  REVOLUTION  123. 


»  THE::  SATELLITE  WAS  DEORBITED  BY  NASA  INTO 
THE  INDIAN  OCEAN  ON  REVOLUTION  123. 

»  THE  SATELLITE  WAS  EXPECTED  TO  DEORBIT  INTO  THE 
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Action-  DEORBIT 
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MISSION  l 

°f  \ 

Rome  Air  Devefopment  Center  ^ 

WK  plan*  and  executes  xeseaxch,  development,  test  and  * 

detected  acquisition  programs  in  support  of  Command,  Control  S 
Communications  and  Intelligence  (C3I)  activities.  Technical  - 
and  engineering  support  within  areas  o&  technical  competence  C 
is  provided  to  ESP  Program  OUices  (POs)  and  other  ESP  i 

elements.  The  principal  technical  mission  areas  axe  ✓ 

communications,  electromagnetic  guidance  and  contxol,  sux-  < 
veUlance  of  gxound  and  aexospace  objects,  intelligence  data  * 
collection  and  handling,  information  system  technology,  > 

ionospheric  propagation,  solid  state  sciences,  microwave  l 

physics  and  electronic  reliability,  maintainability  and  r 

compatibility .  <1 


